MK

Hello, I'm

Mohan Venkata Pavan Sai Teja Kattiboyina

Advancing Healthcare Through Optimization & Data Science

Healthcare Research Collaborator | Optimization & ML

I develop ILP/MIP optimization models for healthcare scheduling, codify clinical policies into reproducible constraints, and apply machine learning to broader research problems with measurable impact.

Seeking full-time healthcare research roles

Research Assistant · Health Informatics · Health OR · Clinical Analytics

About

About Me

Healthcare-focused research collaborator bridging optimization, ML, and clinical operations.

Healthcare Research Collaborator

I'm a research-driven analyst applying quantitative methods to healthcare operations and scheduling. I currently collaborate with a university faculty member on a healthcare scheduling optimization study, codifying clinical policies into ILP/MIP constraint families and contributing toward publication.

My broader work spans applied ML, simulation, and OR — from RL control to large-scale BI systems — brought back to healthcare problems where rigor, fairness, and reproducibility matter most.

Health InformaticsHealthcare ORClinical AnalyticsHealth AI
Education

MS, Business Analytics & AI

The University of Texas at Dallas

Expected May 2027 • GPA 3.7/4.0

  • Coursework: ML, Big Data Analytics, Statistical Modeling.
  • Research collaboration on optimization and scheduling.

B.Tech, Electronics & Communication

IIIT Nagpur

Graduated 2022

  • Walking-robot RL simulation in MATLAB/Simulink — 95% accuracy.
  • Foundations in programming, signal processing, mathematical modeling.

13+

Modeling Assumptions

Codified for healthcare scheduling

5

Constraint Families

ILP/MIP models for publication

20+

Papers Synthesized

Literature review depth

95%

RL Model Accuracy

Walking robot, 5,000+ iterations

40%

Simulation Speedup

Automated DL pipeline

20%

Capacity Optimization

Peak-hour metro operations

98%

Data Alignment

Weather-flight pipeline

30%

Faster Reporting

BI dashboard automation

Toolbox

Technical Skills

A breadth of tools across analytics, ML, and operations research.

Python

Expert

Data analysis, ML, automation

SQL

Expert

Schema design, complex queries

Machine Learning

Advanced

Predictive models, classification

Tableau

Advanced

Visualization, storytelling

ETL Pipelines

Proficient

Integration, automation

Excel

Advanced

Pivots, macros, modeling

R

Proficient

Statistical analysis, viz

MATLAB

Proficient

Numerical computing

C++

Proficient

Algorithms, data structures

Streamlit

Proficient

ML/data web apps

Stata

Proficient

Econometrics, health data

Credentials

Certifications

Foundational training supporting work in healthcare analytics, clinical data, and operations research.

View Certificate

Python Data Analysis for Healthcare

LinkedIn Learning

2025 · Certificate of Completion

Python
Certificate

Google Analytics Certification

Google

2026 · Self-paced

Analytics
Certificate

Excel for Business Analysts

LinkedIn Learning

2025 · 2h 41m

Excel
Certificate

MySQL Data Analysis

LinkedIn Learning

2025 · 4h 30m

SQL
Certificate

Intermediate R

DataCamp

2024 · 6 hours

R

Work

Research & Featured Projects

Healthcare scheduling research alongside broader ML and optimization work.

Healthcare Scheduling Optimization (Research)

Faculty-led academic research translating clinical scheduling policies into ILP/MIP optimization models with cost, coverage, and fairness KPIs.

Problem

Hospital units face complex staff and resource scheduling problems where coverage, rest rules, and fairness must be balanced against cost — but policies are rarely codified in a way that supports reproducible optimization research.

Solution

Codified 13+ modeling assumptions and an evaluation plan, established a public-data acquisition workflow with a QA'd variable dictionary, and translated 16 policy items into 5 ILP/MIP constraint families (coverage, rest/consecutive, assignment, fairness, soft-coverage); synthesized 20+ papers to position the work for publication.

Impact
  • 13+ modeling assumptions codified for reproducible research
  • 5 ILP/MIP constraint families spanning coverage, rest, fairness, soft-coverage
  • 95%+ completeness in variable dictionary and QA checklist
  • ~30% faster alignment across revision cycles, ~25% rework reduction
  • 20+ papers synthesized into a structured literature review
ILP/MIPOperations ResearchPythonMathematical ModelingHealthcare AnalyticsReproducible Research

AI Nutrition Companion: Real-Time Restaurant Meal Recommendation

Streamlit-based AI nutrition assistant that proactively guides users to healthier meal choices when eating at restaurants, using personal profile, daily intake, and menu data.

Problem

People trying to follow a healthy diet struggle when eating out — most calorie trackers only log meals after eating and do not proactively guide users before they order at a restaurant.

Solution

Built and deployed a Streamlit web app that estimates daily nutrition targets from demographics, activity, and goals, analyzes remaining calories/protein/fiber/sodium against meals already eaten, and ranks restaurant menu items with a rule-based engine classifying options as Light, Medium, or Heavy with clear explanations.

Impact
  • 10+ restaurant categories supported (La Madeleine, Chipotle, CAVA, Panera, Chick-fil-A, etc.)
  • 20+ common menu items scored across calories, protein, fiber, sodium, sugar
  • 3-tier classification (Light / Medium / Heavy) for fast decisions
  • Context-aware recommendations using demographics, activity, goals, meal history, time of day
  • Deployed on Streamlit Cloud with modular app.py / data.py architecture
PythonStreamlitPandasRule-Based RecommendationNutrition ScoringStreamlit CloudDigital Health

Brain Tumor Detection Using Image Segmentation

Healthcare AI project applying preprocessing and segmentation workflows to identify tumor regions in medical imaging data.

Problem

Manual review of medical images can be time-intensive, and tumor-region localization requires consistent preprocessing and segmentation support for clearer analysis.

Solution

Applied image preprocessing and segmentation techniques to isolate suspected tumor regions in brain scan imagery, strengthening practical exposure to healthcare AI, biomedical image analysis, and statistical evaluation workflows.

Impact
  • Built healthcare-focused image segmentation workflow for tumor-region identification
  • Applied preprocessing steps to improve scan consistency before analysis
  • Strengthened medical imaging, healthcare AI, and statistical analysis experience
  • Positioned as a foundation for future research publication
PythonImage ProcessingImage SegmentationHealthcare AIMedical ImagingComputer Vision

Walking Robot using Reinforcement Learning

Walking robot simulation using MATLAB and Simulink, leveraging Reinforcement Learning to optimize control systems.

Problem

Complex locomotion control requiring intelligent optimization of robot movements across multiple trajectory patterns.

Solution

Developed walking robot simulation in MATLAB/Simulink with Reinforcement Learning for control optimization, and engineered an automated deep learning pipeline for neural network training and testing.

Impact
  • 95% accuracy after training over 5,000+ iterations
  • 40% reduction in simulation time through pipeline automation
  • 4 movement trajectories mastered (straight line, circle, square, rectangle)
  • Optimized control systems for enhanced robot performance
MATLABSimulinkReinforcement LearningDeep LearningNeural NetworksControl Systems

Metro Operations Optimization

Data-driven optimization system for Delhi Metro operations using Python analytics techniques.

Problem

Inefficient metro routes, unbalanced capacity utilization, and extended wait times affecting passenger experience across 200+ stops.

Solution

Analyzed 7+ datasets (stops, trips, routes, schedules) using Pandas and NumPy, developed spatial-temporal algorithms for stop density and trip intervals, and visualized hub density with Matplotlib and Seaborn.

Impact
  • 20% boost in peak-hour capacity (6-10 AM, 4-8 PM)
  • 10% reduction in off-peak trips optimizing resource utilization
  • 15% reduction in average passenger wait times
  • 10% improvement in overall metro system efficiency
PythonPandasNumPyMatplotlibSeabornOptimization AlgorithmsStatistical Modeling

Flight Weather Delay Analysis

Comprehensive analysis of flight delays correlated with weather patterns to identify key delay drivers and predict disruptions.

Problem

Airlines facing significant operational and financial losses due to weather-related flight delays without predictive insights.

Solution

Integrated 240+ GSOD weather station CSVs with 500K+ FAA flight performance records using Python (Pandas, Seaborn), automating data ingestion and transformation workflows.

Impact
  • 98% accurate date-station alignment in weather-flight mapping pipeline
  • Quantified +7.5 min delay impact under heavy rain conditions
  • 22% higher seasonal delay spikes identified in winter months
  • Top 10 reliable airports with 30% lower average delay vs. national mean
PythonPandasSeabornNumPyData VisualizationStatistical AnalysisJupyter

Maven Movie Retail Analytics

SQL-driven business intelligence project analyzing movie rental operations to optimize inventory, customer engagement, and revenue.

Problem

Movie rental business lacking data-driven insights for inventory management, customer segmentation, and stakeholder engagement.

Solution

Delivered comprehensive reports by joining 16+ tables mapping store managers, inventory, and stakeholders across 10+ locations with advanced SQL aggregation of 1,000+ films and transactions.

Impact
  • Enhanced store-level planning visibility across 10+ locations
  • Identified rating-based inventory distributions and top-value customers
  • 12% increase in repeat customer retention through improved segmentation
  • Streamlined board engagement by linking transactions, advisors, and awards
SQLMySQLDatabase DesignData AnalysisBusiness Intelligence

SBA Loans Default Prediction

Machine learning model to predict loan defaults for the Small Business Administration loan program.

Problem

High default rates impacting SBA loan program efficiency requiring accurate risk assessment and real-time scoring capabilities.

Solution

Engineered 15+ analytical features (loan-to-asset ratio, processing efficiency), trained an XGBoost classifier optimized for AUCPR on imbalanced data with early stopping and threshold tuning, and deployed via Gradio on Hugging Face Spaces.

Impact
  • AUCPR of 0.472 achieved on imbalanced loan data
  • Real-time loan scoring capability via Hugging Face deployment
  • Enhanced model transparency using SHAP and permutation importance
  • Reusable scoring pipeline for stakeholder insights
PythonXGBoostScikit-learnGradioSHAPHugging FaceFeature Engineering

Conagra Brands Meat Substitute Analysis

Data-driven revival strategy for Gardein plant-based products with market insights and growth recommendations.

Problem

15% decline in traditional meat sales (2020-2023) with rising plant-based alternatives requiring strategic market positioning.

Solution

Analyzed a 4-year sales dataset (~$50B market), conducted hypothesis testing revealing 25-40 age-group preferences, applied regression for price elasticity, and designed 10+ market experiments.

Impact
  • 20% higher preference for meat substitutes among adults aged 25-40
  • 10% price increase led to 5% demand drop (price elasticity)
  • 12% increase in repeat purchases through regional promotions
  • SKU rationalization recommendations for portfolio optimization
PythonPandasStatistical AnalysisHypothesis TestingRegression AnalysisData Visualization

Business Intelligence Dashboards @ Samsung SDS

Real-time BI dashboards connecting 16+ data sources for Samsung SDS, processing 1M+ records daily.

Problem

Fragmented data sources and manual reporting causing delays in business insights.

Solution

Built automated ETL pipelines and interactive dashboards with QlikView and Power BI for real-time analytics.

Impact
  • 30% faster reporting efficiency
  • 40% improved dashboard accessibility
  • 80% accuracy boost in KPI tracking
QlikViewPower BISQLETL PipelinesPythonData Warehousing

Journey

Experience

Professional and research milestones.

Research Collaborator

·University Research Project·Oct 2025 — Present
  • Codified 13+ modeling assumptions and an evaluation plan (cost/coverage/fairness KPIs).
  • Built defensible public-data acquisition workflow with 95%+ variable-dictionary completeness.
  • Translated 16 policy items into 5 ILP/MIP constraint families; synthesized 20+ papers.
OptimizationMathematical ModelingResearch

Business Intelligence Developer

·Samsung SDS·Jul 2022 — Nov 2023
  • Integrated 16+ data sources into unified real-time dashboards (+30% reporting efficiency).
  • Designed ETL pipelines processing 1M+ records daily with 99.5% accuracy.
  • Led migration to QlikView platform: 80% accuracy boost, 40% better accessibility.
  • Automated workflows in Python/SQL, cutting manual effort by 50%.
QlikViewSQLETLPython

Engineer Intern

·Samsung SDS·Feb 2022 — Jun 2022
  • Earned full-time offer through top performance in technical assessments.
  • Solved 100+ DSA problems, +40% problem-solving speed (top 54% of cohort).
  • Collaborated with senior engineers on code reviews and scalable data systems.
PythonAlgorithmsProblem Solving

Published on LinkedIn

Newsletter & Insights

Sharing knowledge on data science, analytics, and business intelligence.

Professional Insights Featured

By Mohan Venkata Pavan Sai Teja Kattiboyina

Professional Development Dec 2023· 4 min read

Creativity Is Valuable — But Execution Is Essential

Do's and Don'ts in a Professional Setting — Part 1. While creativity starts the conversation, it's execution that delivers results. An idea has value only when it's executable.

Professional Insights Featured

By Mohan Venkata Pavan Sai Teja Kattiboyina

Career Insights Dec 2023· 5 min read

Do We Really Need to Post on LinkedIn to Grow?

A reflection on visibility, authenticity, and professional growth. Skills build your foundation. Authenticity gives them a voice.

Stay Updated with More Insights

Follow for regular updates on data science, analytics, and professional development.

View All Posts on LinkedIn

Contact

Get In Touch

Open to full-time Healthcare Research and Biostatistician roles in healthcare scheduling, informatics, and clinical analytics.

MK

Mohan Venkata Pavan Sai Teja Kattiboyina

MS Business Analytics & AI · Healthcare Research

Book Call
Available for full-time Healthcare Research or Biostatistician roles

New Message

This will open your default email client.