🖤 Hi! I'm Su. Sumin (수민) for those who want my real name.

I work on post-training, evaluation, and model behavior. I'm particularly interested in data attribution, data efficiency, and understanding why models succeed, fail, and generalize.

At Turing, I lead Terminal Bench, working on reward design, synthetic data generation, and feedback signals for post-training. Previously, I pre/post-trained domain-specific language models across enterprise and media applications.

Originally from Korea, with formative years in New York and now based in SF.

U.S. Citizen interested in defense tech 🇺🇸🇰🇷

Member of Technical Staff, Post-Training & Evals · Turing 2026–Present

Tech Lead, AI Research Scientist · Accenture 2023–2026

M.S. in Artificial Intelligence · Carnegie Mellon University 2021–2023

Data Scientist · Nearpod 2020–2021

Data Scientist · IBM 2017–2020

B.A. in Economics-Statistics & Linguistics · Columbia University 2013–2017

High School Diploma · Korean Minjok Leadership Academy 2010–2013

perpetually oscillating between an art ho and a tech bro

Research

SFT-GO: Supervised Fine-Tuning with Group Optimization for Large Language Models. Developed a token importance-aware post-training method for improving training signal utilization in large language models. arXiv, 2025.
Small Language Models: Architecture, Evolution, and the Future of Artificial Intelligence. Proposed a multi-axis taxonomy for classifying small language models and synthesized emerging approaches to capability-efficiency tradeoffs. Preprint, 2025.
Harnessing Business and Media Insights with Large Language Models. Pre- and post-trained a domain-specialized LLM for business intelligence and media analysis for Fortune Magazine. arXiv, 2024.
Model Probing and Capability Attribution. Developed a probing framework for identifying which latent linguistic signals drive model predictions, enabling capability attribution, systematic error analysis, and instance-level failure diagnosis. Report, 2022.
Behavioral Effects of Model Compression. Investigated how pruning and quantization alter learned representations and downstream model behavior, revealing compression-induced shifts in calibration, prediction dynamics, and output distributions across diverse task settings. Report, 2022.
Low-Resource Machine Translation. Combined target-side monolingual data augmentation with LangRank-guided transfer language selection, improving BLEU scores by up to 345% over baseline systems for Belarusian-English and Azerbaijani-English translation. Report, 2022.
Low-Resource Multilingual ASR. Investigated tokenization and self-supervised representation learning for low-resource speech recognition, evaluating HuBERT, wav2vec 2.0, language model integration, and Byte Pair Encoding vocabulary design in African-accented French. Report, 2022.
Temporal Action Localization. Investigated architectural approaches for long-range temporal reasoning in video understanding, developing extensions to Boundary-Matching Network that improved ActivityNet-1.3 localization performance by 0.9 AUC through temporal feature propagation and global context aggregation. Report, 2021.

Google Scholar

Projects

Driver/Restaurant Recommendation System

Developed an Uber-like Java program that matches real-time client requests to available cab drivers and targeted restaurant advertisements based on regression models by processing multiple streams of GPS, Apple Watch user health and biometric data, and Google Map & Yelp business data. Deployed Kafka and Samza on a YARN cluster provisioned on AWS.

Su Park

Research

Projects

Driver/Restaurant Recommendation System

Deep Learning Algorithms From Scratch

Twitter Analytics Web Service

Heterogeneous Storage for Social Networking

..other side quests