Skip to content
View gksdusql94's full-sized avatar

Block or report gksdusql94

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gksdusql94/README.md

Facts About Me

👋 Hi, I’m @Yeonbi Han, MD(DKM, Doctor ofr Korean Medicine) and Data Science Student in The City University of New York (CUNY) Graduate Center. As a data-driven healthcare professional, I leverage my background as a medical doctor and my expertise in data science to tackle healthcare challenges. My passion lies in extracting actionable insights from pharmaceutical and healthcare data to drive evidence-based decision-making, improving patient outcomes and operational efficiencies.

👀 I hold a bachelor’s degree in Medicine, as well as a bachelor’s in Computer Engineering and Artificial Intelligence. With 3 years of experience as a physician specializing in Oriental Medicine, my true passion lies in data and its ability to tell meaningful stories. I’m dedicated to bridging the gap between medicine and technology to create innovative healthcare solutions. I have worked as a Clinical Scientist Intern at Johnson & Johnson, where I analyzed medical data and built disease prediction models. Currently, I am a Data Science Intern at Rutgers, applying Natural Language Processing to healthcare data.

What Can I Offer

🌱 My journey includes developing machine learning models to predict patient outcomes, analyzing clinical trial data, and collaborating on research to enhance healthcare systems' efficiency. My expertise spans across:

  • Probability and Statistics: SPSS, SAS Programming
  • Natural Language Processing (NLP)
  • Big Data Analysis & Algorithms: Python (Scikit-learn, TensorFlow, PyTorch, NLTK), R
  • Data Visualization: Python (Matplotlib, Seaborn), Tableau, R, SAS
  • Machine Learning & AI: Linear Regression, Logistic Regression, Support Vector Machine (SVM), k-Nearest Neighbors (kNN), Decision Tree, Ensembles, Recurrent Neural Networks (RNN), Convolutional Neural Networks (CNN)

Reach Out to Me

📫 I am actively looking for 2025 full-time positions in Healthcare Data Scientist and welcome connections via email or LinkedIn:

Pinned Loading

  1. Cluster_BreastCancer Cluster_BreastCancer Public

    This project demonstrates the use of K-Means clustering and Singular Value Decomposition (SVD) for analyzing the Breast Cancer Wisconsin dataset using Apache Spark, focusing on clustering performan…

    Jupyter Notebook 1

  2. ML_Hospital ML_Hospital Public

    This project aims to predict hospital readmissions in diabetic patients using machine TF-IDF.

    Jupyter Notebook 1

  3. ML_Voting ML_Voting Public

    This project models U.S. presidential precinct voting patterns in Washington, focusing on income-based voter behavior by using ML like Random Forest, SVR, and deep L, with Random Forest.

    Jupyter Notebook

  4. ML_House ML_House Public

    This project uses the K-Nearest Neighbors (KNN) algorithm to predict home prices in Ames, Iowa, based on historical data, focusing on key features like house size, quality, and area

    Jupyter Notebook 1

  5. ML_Wine ML_Wine Public

    This project predicts wine quality (good or bad) using KNN and Logistic Regression. KNN with Manhattan distance and IDW achieved the best F1 score of 0.87, outperforming Logistic Regression.

    Jupyter Notebook 1

  6. Dementia_NLP Dementia_NLP Public

    This project analyzes sentence construction patterns in dementia patients using NLP techniques.

    Jupyter Notebook 1