L. Mark Coty

AWS‑certified Data Engineer and Data Scientist turning data into actionable business insightsSeeking full-time, contract, and remote positions.


  • I have the AWS Data Engineering Associate certificate: DEA-C01

  • My resume may be found here, my GitHub is here, and you may email me at [email protected].

  • In Programming & Data Analysis I have experience and certifications in Python, SQL, Pandas, NumPy, SciPy, Scikit-Learn, Matplotlib, Seaborn, etc.

  • In Machine Learning & Statistical Methods I am fluent in Regression, Classification, A/B Testing, Hypothesis Testing, Bayesian Inference

  • In Data Engineering & Visualization I have used Tableau, AWS QuickSight, and processes for Data Cleaning and Feature Engineering

  • In Tools & Workflow, I've worked with Jupyter Notebooks, Git, GitHub, Jira, Google Suite, Microsoft Suite

  • I have 25+ years experience teaching mathematics at the college level.

  • I have managed the math and ELA teams for a major US test-prep company.

  • Languages: English (Fluent), German (Fluent Translation Skills)


Projects and Products


1. MHA Tracker
The AWS-based MHA Tracker gives pastors and ministry leaders a simple, secure way to track housing expenses and maintain IRS-compliant records. You can spend less time buried in receipts and more time focused on ministry. Everything you need for tax season, right at your fingertips.

Website: MHA Tracker
GitHub: MHA Tracker


2. KaiMate Food Ordering App
An AWS-built app connecting food lovers with local eateries, while strengthening communities through shared values of manaakitanga, whanaungatanga, kotahitanga, kaitiakitanga, and tautoko.

Website: KaiMate
GitHub: KaiMate


3. AWS Project: NYC Taxi Lakehouse
Designed and implemented a serverless data lakehouse on AWS to process NYC TLC Yellow Taxi trip data (3.3M+ rows/month), using Amazon S3 (Bronze/Silver/Gold), AWS Glue (PySpark ETL), Glue Data Catalog, and Amazon Athena. Built partitioned Parquet fact and dimension tables, implemented dimensional joins and CTAS-based analytics marts, and published monthly data quality metrics (referential integrity, row counts, trip duration and fare statistics) to validate pipeline health and query performance.

Website: NYC Taxi
GitHub: NYC Taxi


4. The Ugly Professor? Professor evaluations vs physical appearance
Data from student evaluations for 463 courses taught by 94 professors from the University of Texas at Austin. Also, six students rate the professors' physical appearance.

Website: Ugly Professor?
GitHub: Ugly Professor?


5. Trends & Explicitness in Spotify Songs (EDA + Modeling)
An analysis of a 2024 list of 237K Spotify Songs

Website: Spotify Songs
GitHub: Spotify Songs


6. AWS Weather API Project
An AWS Pipeline from OpenWeather API to Visualizations

Website: API Weather
GitHub: API Weather


7. Heart Attack Risk Prediction Model
Developed a predictive model using logistic regression and other machine learning techniques.

Website: Heart Attack
GitHub: Heart Attack


8. Data Science Job Postings with Salaries (2025)
Used scraped job postings to compare salaries based on numerous features and used hypothesis tests to compare job titles' offerings.

Website: Data Science Jobs
GitHub: Data Science Jobs


9. Telco Churn Prediction Model
Trained and fit a model to predict customer churn. Used an expanded version of the dataset.

Website: Telco Churn
GitHub: Telco Churn


10. Solar Panels Chingu Project
Simplified the process of scheduling solar panel evaluations for Los Angeles residents and city hall employees.

Website: Solar Plexus
GitHub: Solar Panels App