Skip to content
View Samantha-A-Taylor's full-sized avatar

Block or report Samantha-A-Taylor

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Samantha-A-Taylor/README.md

About Me 👋


Welcome to my page! I am @Samantha-A-Taylor, a data enthusiast with a background in bioinformatics and experience in Python, R, SQL, and data visualization. I have a strong foundation in exploratory data analysis, statistical modeling, and database management, and I enjoy turning complex datasets into clear, meaningful results.

After a few years teaching, I have been focusing on refining my data analytics skills through projects in sales analysis, fraud detection, education trends, and other applied analytics. This page highlights my work and the projects I have completed while preparing to bring my skills to data-driven roles in business and research.

Top Skills 💡

  ✓ Data analysis and visualization (Python: Pandas, Matplotlib, Seaborn; R)  
  ✓ SQL and relational database management  
  ✓ Microsoft Excel (PivotTables, PivotCharts, formulas, and functions)
  ✓ Dashboards and interactive reporting (Tableau and Excel)
  ✓ Exploratory data analysis and statistical modeling  
  ✓ Data cleaning, processing, and pipeline automation  

Projects 📂


Professional Projects 💻

Data-Driven Insights into Medication Errors Across Healthcare Settings 💊
Independent Project     |     Remote

    Analyzed real-world medication safety incident data from the National Reporting and Learning System 
    (NRLS) in England and Wales (2016–2021) using Python, Pandas, SQL/SQLite, Seaborn, Matplotlib, 
    and Squarify. Conducted end-to-end analysis to evaluate medication error patterns, reported harm,
    guideline-related contributing factors, workflow vulnerabilities, and spatial–temporal variation.
    Generated actionable insights to support patient safety, quality improvement, and targeted risk 
    reduction across healthcare settings.  

Diabetic Patient Readmission Trends & Dashboard: Identifying High-Risk Patients, Departments, and Clinical Drivers 🩸
Independent Project     |     Remote

  Analyzed 10 years of U.S. hospital diabetic patient data (1999–2008) using Excel pivot tables, charts, 
  and interactive dashboards. Conducted end-to-end analysis of readmission trends, patient demographics, 
  departmental performance, and clinical complexity drivers. Engineered readmission_flag metrics, visualized 
  risk patterns by specialty, age group, and number of diagnoses, and synthesized actionable insights to 
  support hospital decision-making, targeted interventions, and care coordination strategies.

Sales Performance & Revenue Analysis KPI Trends, Product Insights, and Regional Performance 📈
Independent Project     |     Remote

  Analyzed a synthetic, transaction-level retail sales dataset spanning 2022–2024 using Python, Pandas, 
  SQL/SQLite, Seaborn, and Matplotlib. Conducted end-to-end EDA to evaluate revenue and profit KPIs, 
  product category performance, regional trends, discount impacts, and customer purchasing behavior, 
  generating actionable insights into sales drivers, operational efficiency, and business performance 
  across time and geography.

Multi-Factor Transaction Monitoring and Fraud Risk Analysis 🫆
Independent Project     |     Remote

  Analyzed a synthetic financial transaction dataset using SQL/SQLite and Tableau. Conducted end-to-end 
  exploratory data analysis (EDA) to examine transaction amounts, frequency patterns, and login behavior. 
  Engineered multi-factor fraud metrics to flag high-risk transactions, scored accounts by combined 
  anomalies, and visualized behavioral and geographic patterns to generate actionable insights for 
  proactive fraud detection and transaction monitoring.  

Student Performance Analytics and Trend Analysis | March Madness Reading Challenge 📚
Cabarrus County Schools     |     Concord, North Carolina

  Explored K–5 student reading log data from the March Madness Reading Challenge using Python, Pandas, 
  Seaborn, and Matplotlib.pyplot. Conducted EDA to track weekly reading trends, identify top performers 
  and at-risk students, and analyze class-level distributions and outliers to generate actionable insights.

Quantitative Analysis of Socioeconomic Indicators, Crime, and Educational Outcomes in Chicago Communities 🔎
Independent Project     |     Remote

  Conducted community-level analysis of Chicago census, public school, and crime datasets using Python, Pandas, 
  Seaborn, Matplotlib, and SQLite3. Performed EDA to explore distributions, temporal and spatial trends, and 
  correlations across socioeconomic, crime, and educational metrics, generating actionable insights for 
  policymakers and educators.

COVID-19 Case Trends and State-Level Analysis in the United States | Summer 2020 Dynamics 🦠
UNC Charlotte's College of Computing and Informatics     |     Charlotte, North Carolina

   Explored US COVID-19 confirmed case data from March through September 2020 using Python, Pandas, 
   and Matplotlib. Conducted EDA to track state-level trends, calculate numerical and percentage increases, 
   compute weekly average case growth, and analyze proportions of national cases, generating actionable 
   insights into pandemic dynamics across states and territories. 

Research & Academic Projects 🔬

Misc. Bioinformatics Micro-Projects 🧬
UNC Charlotte's College of Computing and Informatics     |     Charlotte, North Carolina

 Graduate-level bioinformatics and phylogenetics research projects involving Bayesian and maximum likelihood 
 inference, coalescent species tree estimation, ancestral state reconstruction, molecular clock dating, gene 
 flow testing, and genetic distance modeling using R, R Markdown, RAxML, MrBayes, ASTRAL, phytools, phangorn, 
 MUSCLE, Clustal-Omega, and HPC-based workflows.

Local Adaptation and Range Expansion in Culex tarsalis: Comparative Genomics and Population-Level Analysis​ 🦟
North Carolina Research Campus     |     Kannapolis, North Carolina

 Investigated the genomic basis of local adaptation and potential range expansion in Culex tarsalis, a primary 
 vector of West Nile virus, using reference genome analysis, orthogroup identification, gene duplication events, 
 protein function annotation, and population-level sequencing data. Conducted quality control and alignment of 
 884 Illumina paired-end samples, evaluated mapping efficiency, and explored alternative alignment strategies. 
 Performed comparative genomics to identify species-specific gene duplications, linked orthogroups to Gene 
 Ontology terms, and visualized functional patterns through frequency tables and word clouds, generating 
 actionable insights into environmental and genetic factors driving divergence among Pacific, Sonoran, and 
 Midwest mosquito populations to inform disease management and anticipate geographic spread.

Creative Projects ✨

Mass Effect 3: War Asset Distribution & Strategic Analysis image


Independent Project     |     Remote

  Analyzed war asset distribution across the Milky Way in Mass Effect 3 using Python, Pandas, Seaborn, 
  Matplotlib, and SQLite. Hand-curated dataset from a 100+ hour playthrough. Conducted exploratory data 
  analysis to quantify assets at region, cluster, and star system levels, modeled optimal visitation 
  sequences, visualized galaxy hierarchy with radial and cluster-focused plots, and generated actionable 
  insights for strategic resource collection and efficient gameplay.

Education 🎓


Master of Science in Bioinformatics and Genomics       |       The University of North Carolina at Charlotte

  ✓ Programming [R, Python, SQL]  
  ✓ Scripting [Bash, UNIX/Linux Shell]  
  ✓ Statistical Analysis [Pandas, NumPy, Seaborn, NumPy, Matplotlib, TopGO, etc.]  
  ✓ Database Management & Modeling  
  ✓ Data Processing  
  ✓ Algorithm Development  
  ✓ Sequence Alignment  
  ✓ Molecular Data Analysis  
  ✓ Phylogenetics  
  ✓ Machine Learning

Databases and SQL for Data Science with Python by IBM Course Certification       |       IBM & Coursera

  ✓ Data Analysis Within a Database [SQL & Python]  
  ✓ Developing Basic to Intermediate SQL Queries for Data Manipulation Using DML Commands  
  ✓ Building and Structuring Relational Databases, Creating and Modifying Tables via DDL  
  ✓ Construct Complex Queries Leveraging Advanced SQL Features Including Views, Transactions, Stored Procedures, and Joins

Bachelor of Arts in Biology       |       The University of North Carolina at Charlotte

   Minors: Statistics and Bioinformatics

  ✓ Foundational and Advanced Statistical Methods [Probability Models, Inference Techniques, Regression Analysis, and Experimental Design]  
  ✓ Programming [R, Python, SQL]  
  ✓ Scripting [Bash, UNIX/Linux Shell]

Popular repositories Loading

  1. COVID-19CaseTrendsAndState-LevelAnalysisInTheUS COVID-19CaseTrendsAndState-LevelAnalysisInTheUS Public

    A quick project serving as an introduction to data frames and data manipulation using Pandas.

    Jupyter Notebook 1

  2. QuantitativeAnalysisOfSocioeconomicIndicatorsCrimeAndEducationalOutcomesInChicagoCommunities QuantitativeAnalysisOfSocioeconomicIndicatorsCrimeAndEducationalOutcomesInChicagoCommunities Public

    Community-focused analytics of Chicago’s census, crime, and school datasets. Combines data cleaning, visualization, and cross-dataset correlation analysis to explore how socioeconomic factors relat…

    HTML 1

  3. Culex-tarsalis Culex-tarsalis Public

    Culex tarsalis is a North American mosquito species known for being the principle vector of the West Nile virus along with other arboviruses. The purpose of this project is to examine local adaptat…

    HTML

  4. Samantha-A-Taylor Samantha-A-Taylor Public

    Config files for my GitHub profile.

  5. StudentPerformanceAnalyticsAndTrendAnalysis-MarchMadnessReadingChallenge2025 StudentPerformanceAnalyticsAndTrendAnalysis-MarchMadnessReadingChallenge2025 Public

    Analyze K–5 reading logs during March Madness Reading Challenge: track trends, top performers, participation, and outliers to support instruction and highlight student progress.

    HTML

  6. MiscBinfMicroProjects MiscBinfMicroProjects Public

    Miscellaneous micro-projects from my Master’s degree focused on phylogenetics and evolution: Bayesian and maximum likelihood trees, concatenated vs coalescent analyses, ancestral state reconstructi…