Health Data Science and Big Data Analytics AI Research Group

About Our Research Group

We adopt a systematic approach to understand the complex structure of health data. We develop clinical and administrative decision support systems by integrating data from electronic health records, population-based health statistics, biobanks, and national health systems.

While ensuring the systematic implementation of data-driven health solutions, we prioritize data security, privacy, and ethical use principles. The application of big data technologies in healthcare requires special approaches, particularly in terms of scalability and performance.

The group is developing algorithms that detect inconsistencies in electronic health records, meaningfully complete missing data, and ensure data integrity. This process also includes standardization and harmonization of data from different healthcare institutions.

Our Core Approach
                                    
Systematic analysis of large-volume health data

Data security and privacy-focused solutions

Multi-source data integration and harmonization

Time series analysis and prediction models

Our Application Areas

Core application areas we develop in health data science and big data analytics

Electronic Health Records

EHR analysis, clinical pattern extraction, and patient profiling

Epidemiological Analysis

Population-based health data modeling and disease burden analysis

Health Economics

Cost optimization and health expenditure analysis

Resource Planning

Hospital capacity and healthcare personnel optimization

Time Series Analysis

Seasonal disease trend and prevalence prediction models

Data Privacy

Anonymization and data security algorithms

Health Geography

Regional health status mapping and spatial analysis

Pharmacoepidemiology

Drug safety monitoring and supply chain optimization

Our Example Applications

Our big data analytics systems used and tested at the national level

94% Accuracy

National Diabetes Prevalence Prediction Model

Predicts diabetes prevalence across 81 provinces with 94% accuracy using E-Nabız data and supports health planning with 5-year projections.

E-Nabız Prevalence Province-Based

5-year projection and planning support

89% Accuracy

Hospital Capacity Prediction System

Analyzes Medula data, seasonal trends, and demographic factors to predict hospital admissions on a weekly basis with 89% accuracy.

Medula Seasonal Demographic

Optimizes resource planning

National Level

Antibiotic Resistance Monitoring Platform

Analyzes national e-prescription data to monitor antibiotic usage patterns and regionally predicts resistance development risk.

E-Prescription Resistance Regional

Monitors antibiotic usage patterns

Real-time

Pandemic Early Warning System

Analyzes symptom-based admission data in real-time to regionally assess epidemic risks and provide early warnings to health authorities.

Symptoms Real-time Early Warning

Provides early warnings to health authorities

Cost Analysis

Healthcare Expenditure Analysis System

Integrates patient admission data, treatment costs, and demographic factors to analyze the main dynamics of healthcare expenditures.

Cost Demographic Optimization

Provides cost optimization recommendations

Cancer Screening

Cancer Screening Program Analysis

Analyzes national cancer screening data to identify risk groups, evaluates screening effectiveness, and develops personalized screening recommendations.

Screening Risk Groups Personalized

Personalized screening recommendations

Our Data Flow Processes

The methodology we use in the processes from collection to analysis of health data

Data Collection and Integration

Data collection from national systems such as E-Nabız, Medula, and e-Prescription is a sensitive process. We use robust integration algorithms in the face of different formats, coding systems, and data quality issues.

Challenge: Ensuring consistency among millions of records and real-time integration

Data Security and Anonymization

We use differential privacy and homomorphic encryption techniques to protect patient privacy. We focus on conducting scientifically valuable analyses while protecting personal information.

Innovation: Secure data sharing with k-anonymity and l-diversity principles

Data Cleaning and Quality Control

Missing values, outliers, and systematic errors are critical issues in big data analysis. We improve data quality with machine learning-assisted data cleaning algorithms.

Value: Reliable analysis results by achieving 95% data quality

Statistical Analysis and Modeling

We reveal complex relationships in health data through time series analysis, survival analysis, and multivariate statistical methods. Bayesian approaches provide uncertainty quantification.

Criticality: Causal inference in epidemiological analyses

Machine Learning and Prediction

We develop prediction models from health data using deep learning, ensemble methods, and transfer learning. Overfitting and bias problems are addressed with specialized validation strategies.

Future: Inter-hospital collaboration through federated learning

Visualization and Reporting

We transform complex analysis results into comprehensible dashboards and interactive visualizations for decision-makers. We provide continuous monitoring with real-time monitoring systems.

Application: Strategic decision support at the ministerial level

Our Stakeholders

Expert fields we collaborate with through a multidisciplinary approach in health data science

Health Stakeholders

Public Health

Biostatistics

Epidemiology

Health Management

Medical Informatics

Health Economics

Engineering Stakeholders

Data Science

Computer Eng.

Statistics

AI Engineering

Big Data

Cloud Computing

Research and Development Focus Areas

Core research topics we focus on with a comprehensive and systematic approach in health data science

Data Structure and Quality Analysis

Health data has high-dimensional, sparse, heterogeneous, and time-dependent characteristics. We develop algorithms that detect missing values, measurement errors, and systematic biases.

Challenge: Standardization and harmonization of data from different institutions

Time Series and Trend Analysis

Seasonal variations in disease prevalence, epidemic waves, and long-term trends are modeled with temporal analysis techniques. We develop hybrid prediction models.

Innovation: Hybrid models supporting short and long-term projections

Differential Privacy

We research differential privacy, homomorphic encryption, and secure multi-party computation techniques that enable scientific research while protecting patient identities.

Value: Ethical data sharing with k-anonymity and l-diversity

Multi-Modal Data Integration

We provide comprehensive analysis capabilities by combining structured, semi-structured, and unstructured health data. Laboratory, clinical notes, and imaging data are integrated.

Criticality: Establishing meaningful relationships across different data types

Transfer Learning and Adaptation

We provide effective solutions for rare diseases and small patient groups by adapting models trained on large datasets to small and specific datasets.

Future: Inter-hospital model sharing with federated learning

Standardization and Interoperability

We support national and international data sharing by developing data models and transformation algorithms compatible with international standards like FHIR, HL7, and ICD-10.

Application: Interoperability of national health information systems

Our Impact and Achievements

Concrete achievements in health data science and the impact we create at the national level

12+

Active Projects

In national systems

25+

Researchers

Multidisciplinary team

50M+

Patient Records

Analysis capacity

95%

Data Quality

Average level

Collaboration Opportunities

Would you like to collaborate with us to develop big data analytics solutions in health data science?

Public Institutions

Strategic collaborations with Ministry of Health and national data systems

Academic Projects

Inter-university big data research projects and doctoral programs

Technology Partnerships

Solution development with cloud computing and big data technology companies

Get in Touch Other Research Groups