Hi,
I’m Howard
Computational Data Science

Email Me

About

I’m Howard

CMU MCDS student and software/ML engineer focused on scalable ML systems, HPC, and LLM serving. Previously at SDSC, Compex Systems Singapore, and NXP, I’ve built distributed training/orchestration on SLURM/K8s, production RAG systems, and deployed models on AWS and Azure.

Education & Experience

Education

2025 - 2027

Carnegie Mellon University

School of Computer Science

Master of Science in Computational Data Science

Pittsburgh, PA

CMU
2021 - 2025

University of California, San Diego

Halıcıoğlu Data Science Institute

Bachelor of Science in Data Science

San Diego, CA

UCSD

Experience

June 2024 - Present

San Diego Supercomputer Center

Software Engineer Intern - Cyberinfrastructure Enabled ML

San Diego, CA

  • Developed bash and SLURM scripts to orchestrate distributed workloads for PyTorch, Spark, and Dask
  • Automated YAML generation for K8s deployments for dynamic resource allocation based on resource usage
  • Developed a pipeline for Docker to Singularity container conversion for CUDA dependent workloads
  • Deployed GraphRAG on a HPC cluster, configured scalable API endpoints to handle 100,000s of LLM requests
SDSC
June 2023 - September 2023

Compex Systems Singapore

Software Engineer Intern - LLM Innovation

Singapore

  • Implemented an internal RAG for support troubleshooting, adopted by 80% of support staff
  • Setup schema design and management of PostgreSQL vector store with on-demand ETL injection of new data
  • Served LLMs on Azure OpenAI with secure access controls and best practices for managing secrets/keys
  • Fine-tuned recommendation systems on Salesforce ticketing resulting in 2 hours saved per data-sheet search
Compex Systems
June 2022 - September 2022

NXP Semiconductors

Machine Learning Engineer Intern - Power Reduction

San Diego, CA

  • Deployed machine learning models on AWS Sagemaker and leveraged S3 for data retrieval and storage
  • Utilized supervised learning techniques and unsupervised dimensionality reduction with scikit-learn and TensorFlow
  • Developed an ETL pipeline to read and store data logs, increasing data collection speeds by 30%
  • Facilitated efficient job distribution and simulations on Ansys PowerArtist and IBM LSF with shell/SLURM scripts
NXP

Skills

Profesional Skills

Lorem ipsum dolor sit, amet consectetur adipisicing elit. Velit optio id vero amet, alias architecto consectetur error eum eaque sit.

Python
95%
JavaScript
85%
PyTorch
65%
AWS
85%

Projects

Contact