"Portfolio item number 1 ", published in
Posts by Collection
portfolio
"Portfolio item number 2 ", published in
publications
"Game bot detection approach based on behavior analysis and consideration of various play styles ", published in ETRI Journal 2013
"A Data Quality Metric (DQM): How to Estimate The Number of Undetected Errors in Data Sets ", published in VLDB 2017
"Unknown examples & machine learning model generalization ", published in arxiv 2018
"Towards Quantifying Uncertainty in Data Analysis & Exploration ", published in IEEE Bulletin (Data Engineering) 2018
"Estimating the impact of unknown unknowns on aggregate query results ", published in SIGMOD 2016 / TODS 2018 [Extended]
"Slice finder: Automated data slicing for model validation ", published in ICDE 2019
"Democratizing data science through interactive curation of ml pipelines ", published in SIGMOD 2019
"PyTorch/XLA SPMD: Scale Up Model Training and Serving with Automatic Parallelization ", published in PyTorch Blog 2023
"High-Performance Llama 2 Training and Inference with PyTorch/XLA on Cloud TPUs ", published in PyTorch Blog 2023
"Chase-sql: Multi-path reasoning and preference optimized candidate selection in text-to-sql ", published in ICLR 2025
"LLMs and Databases: A Synergistic Approach to Data Utilization ", published in IEEE Data Eng. Bull. 2025
"Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL ", published in VLDB 2025
talks
"Identify trends with the Power and Performance API ", published in Apple WWDC 2020
"Large-Scale Distributed Training with Dynamo and PyTorch/XLA SPMD ", published in PyTorch Conference 2023
"Auto-scaling LLM applications and workloads ", published in Aju University 2024
"PyTorch/XLA Auto-Sharding ", published in PyTorch Conference 2024
teaching
"Teaching experience 1 ", published in University 1, Department 2014
"Teaching experience 2 ", published in University 1, Department 2015