Ilsun Yun Ph.D. Candidate

Computational Biologist | Machine Learning for Cancer Precision Medicine

0 Research Projects
0 Publications
0 Years Experience

Scroll to explore

About Me

Computational biologist specializing in machine learning for cancer precision medicine with a strong computer science foundation. I develop end-to-end ML pipelines that transform multi-omics data into clinically actionable insights.

🤖

Machine Learning

Ensemble methods (Random Forest, XGBoost, AutoGluon), hyperparameter optimization, SHAP interpretation, cross-platform validation

⚙️

Pipeline Engineering

Scalable automated workflows for NGS data processing (WGS, RNA-seq, methylation-seq, 16S rRNA-seq)

🔬

Translational Impact

First-author publications in clinical oncology, validated biomarkers from computational discovery to wet-lab validation

Education

2020 - 2026 (Expected)

Ph.D. in Biomedical Engineering

Ulsan National Institute of Science and Technology (UNIST)

Thesis: Machine Learning Approaches for Cancer Genomics and Microbiome-Based Prediction

Advisor: Prof. Semin Lee

Jan 2019 - Aug 2019

Exchange Student

McGill University, Montreal, Canada

Kyungdong Scholarship recipient (10,000,000 KRW)

Machine Learning Intern at Data4Good (non-profit organization)

Supervisor: Prof. Janosch Ortmann, Université de Montréal

2013 - 2020

B.S. in Biomedical Engineering & Computer Science

Ulsan National Institute of Science and Technology (UNIST)

Double Major | GPA: 3.78/4.3

Key Research Projects

Click on a project to learn more

Published

Multi-omics Characterization of Pediatric Brain Tumors

Acta Neuropathologica Communications (2024)

Developed ensemble ML classifier (F1=0.89) to distinguish somatic vs. germline mutations, enabling genetic analysis without matched normal samples.

Ensemble Learning Feature Engineering Clonality Analysis
Under Review

Predictive Modeling for Colorectal Cancer Subtype Classification

First Author | 109 validation + 1,308 training samples

Built random forest classifiers achieving AUC=0.931 (TCGA) and Macro-AUC=0.857, creating cost-effective qPCR-based CMS4 classifier.

Random Forest SHAP Analysis Cross-platform Validation
In Preparation

Bile Microbiome as Prognostic Biomarker

Co-first Author | 143 patients

Built complete automated 16S analysis pipeline identifying 3 prognostic microbiome groups with significantly different survival outcomes (HR=3.2, p=0.00009).

QIIME2 Unsupervised Clustering Survival Analysis
In Preparation

SChLAP1:UBE2E3 Fusion Discovery in Korean Prostate Cancer

Co-first Author | 114 patients

Developed automated fusion calling pipeline identifying ethnicity-specific fusion as independent predictor of biochemical recurrence (HR=2.86, p=0.02).

STAR-Fusion Cox Regression Precision Medicine

Publications

Published

Comprehensive multiomics analysis reveals distinct differences between pediatric choroid plexus papilloma and carcinoma

Choi, Y.*, Choi, S.A.*, Koh, E.J.*, Yun, I.*, Lee, S. & Kim, S.K.

Acta Neuropathologica Communications (2024), 12, 93

DOI: 10.1186/s40478-024-01814-y
Under Review

Clinical Implications of Platelet-derived Growth Factor Receptor A & B in Colorectal Cancer and Developing a Predictive Model for Consensus Molecular Subtype 4

Yun I., Lee, S.M., Lee, K.Y.

First Author | Manuscript under review

In Preparation

SChLAP1:UBE2E3 Gene Fusion: A Novel Prognostic Biomarker in Korean Prostate Cancer

Kang B.*, Yun I.*, Lee S., Ha H.

Co-first Author

In Preparation

Comparative Analysis of Bile and Fecal Microbiome in Pancreatic & Biliary Cancer Patients

Yun I.*, Kim H.*, Lee S., Park J.

Co-first Author

Technical Skills

Machine Learning & Statistical Modeling

Random Forest XGBoost AutoGluon SVM SHAP Cross-validation Hyperparameter Optimization Ensemble Methods PCA t-SNE UMAP

Programming & Data Engineering

Python R pandas NumPy scikit-learn tidyverse ggplot2 Bash Scripting Git/GitHub Pipeline Automation

NGS Bioinformatics

GATK STAR BWA-MEM DESeq2 STAR-Fusion CNVkit VarDict Ensemble-VEP FastClone RSEM

Microbiome Analysis

QIIME2 DADA2 PICRUSt2 LEfSe ANCOM phyloseq microbiome vegan

Statistical Analysis & Visualization

Cox Regression Survival Analysis Kaplan-Meier GSEA ComplexHeatmap matplotlib seaborn plotly

Data Processing & Computation

Parallel Computing Batch Processing HPC Clusters VCF/BAM/FASTQ

Get In Touch

📧
📍

Location

UNIST, Ulsan, Republic of Korea

📱

Phone

+82-10-4207-9782

Interested in collaboration or have questions about my research?

Download Full CV