"Page Not Found", published in
Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
"Thinking about data systems", published in
"Archive Layout with Content", published in
"Posts by Category", published in
"Posts by Collection", published in
"CV", published in
"CV", published in
"Markdown", published in
"Page not in menu", published in
"Page Archive", published in
"Portfolio", published in
"Publications", published in
"Sitemap", published in
"Posts by Tags", published in
"Talk map", published in
"Talks and presentations", published in
"Teaching", published in
"Terms and Privacy Policy", published in
"Blog posts", published in
"Jupyter notebook markdown generator", published in
Posts
"Future Blog Post ", published in 2199
"Blog Post number 4 ", published in 2015
"Blog Post number 3 ", published in 2014
"Blog Post number 2 ", published in 2013
"Blog Post number 1 ", published in 2012
portfolio
"Portfolio item number 1 ", published in
"Portfolio item number 2 ", published in
publications
"Game bot detection approach based on behavior analysis and consideration of various play styles ", published in ETRI Journal 2013
"A Data Quality Metric (DQM): How to Estimate The Number of Undetected Errors in Data Sets ", published in VLDB 2017
"Unknown examples & machine learning model generalization ", published in arxiv 2018
"Towards Quantifying Uncertainty in Data Analysis & Exploration ", published in IEEE Bulletin (Data Engineering) 2018
"Estimating the impact of unknown unknowns on aggregate query results ", published in SIGMOD 2016 / TODS 2018 [Extended]
"Slice finder: Automated data slicing for model validation ", published in ICDE 2019
"Democratizing data science through interactive curation of ml pipelines ", published in SIGMOD 2019
"PyTorch/XLA SPMD: Scale Up Model Training and Serving with Automatic Parallelization ", published in PyTorch Blog 2023
"High-Performance Llama 2 Training and Inference with PyTorch/XLA on Cloud TPUs ", published in PyTorch Blog 2023
"Chase-sql: Multi-path reasoning and preference optimized candidate selection in text-to-sql ", published in ICLR 2025
"Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL ", published in VLDB 2025
talks
"Identify trends with the Power and Performance API ", published in Apple WWDC 2020
"Large-Scale Distributed Training with Dynamo and PyTorch/XLA SPMD ", published in PyTorch Conference 2023
"Auto-scaling LLM applications and workloads ", published in Aju University 2024
"PyTorch/XLA Auto-Sharding ", published in PyTorch Conference 2024
teaching
"Teaching experience 1 ", published in University 1, Department 2014
"Teaching experience 2 ", published in University 1, Department 2015