"Page Not Found", published in
Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
"Thinking about data systems", published in
"Archive Layout with Content", published in
"Posts by Category", published in
"Posts by Collection", published in
"CV", published in
"CV", published in
"Markdown", published in
"Page not in menu", published in
"Page Archive", published in
"Portfolio", published in
"Publications", published in
"Sitemap", published in
"Posts by Tags", published in
"Talk map", published in
"Talks and presentations", published in
"Teaching", published in
"Terms and Privacy Policy", published in
"Blog posts", published in
"Jupyter notebook markdown generator", published in
Posts
"Future Blog Post ", published in 2199
"Blog Post number 4 ", published in 2015
"Blog Post number 3 ", published in 2014
"Blog Post number 2 ", published in 2013
"Blog Post number 1 ", published in 2012
portfolio
"Portfolio item number 1 ", published in
"Portfolio item number 2 ", published in
publications
"BitTorrent Network Traffic Analysis for Peer Link Prediction ", published in
"Game bot detection approach based on behavior analysis and consideration of various play styles ", published in
"Personalized Expert-Based Recommender System: Training C-SVM for Personalized Expert Identification. ", published in
"Sentiment Analysis Using News Comments for Public Opinion Mining ", published in
"TV program recommendation method using LDA clustering ", published in
"Semi-supervised learning for sentiment analysis in mass social media ", published in
"A Behavior Analysis-Based Game Bot Detection Approach Considering Various Play Styles. ", published in
"Towards interactive data exploration ", published in
"Using RDMA for Lock Management. ", published in
"A Data Quality Metric (DQM): How to Estimate The Number of Undetected Errors in Data Sets. ", published in VLDB
"Estimating the Impact of Unknown Unknowns on Aggregate Query Results. ", published in SIGMOD
"A Data Quality Metric (DQM): How to Estimate the Number of Undetected Errors in Data Sets. ", published in
"Towards Interactive Data Exploration. ", published in
"Unknown examples & machine learning model generalization ", published in arxiv 2018
"Estimating the Impact of Unknown Unknowns on Aggregate Query Results. ", published in TODS
"Towards Quantifying Uncertainty in Data Analysis & Exploration ", published in IEEE Bulletin (Data Engineering) 2018
"Improved Neighborhood Search for Collaborative Filtering. ", published in IJFIS
"Slice Finder: Automated Data Slicing for Model Validation. ", published in ICDE
"Towards Interactive Curation & Automatic Tuning of ML Pipelines. ", published in MLSys
"Towards Quantifying Uncertainty in Data Analysis & Exploration. ", published in IEEE Data Engineering Bulletin
"Unknown Examples & Machine Learning Model Generalization. ", published in arxiv
"Democratizing Data Science through Interactive Curation of ML Pipelines. ", published in SIGMOD
"Quantifying Uncertainty in Data Exploration. ", published in Brown University
"Slice Finder: Automated Data Slicing for Model Validation. ", published in ICDE
"Automated Data Slicing for Model Validation: A Big Data - AI Integration Approach. ", published in TKDE
"PyTorch/XLA SPMD: Scale Up Model Training and Serving with Automatic Parallelization ", published in PyTorch Blog 2023
"High-Performance Llama 2 Training and Inference with PyTorch/XLA on Cloud TPUs ", published in PyTorch Blog 2023
"CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL. ", published in
"CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL. ", published in ICLR
"High-Fidelity And Complex Test Data Generation For Real-World SQL Code Generation Services. ", published in arxiv
"Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL. ", published in VLDB
"LLMs and Databases: A Synergistic Approach to Data Utilization. ", published in IEEE Data Engineering Bulletin
"Multi-Objective Agentic Rewrites for Unstructured Data Processing. ", published in
"100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models ", published in
"Fine-Grained Table Retrieval Through the Lens of Complex Queries. ", published in
"PRISM: Navigating Cost–Accuracy Trade-offs for NL2SQL ", published in
talks
"Identify trends with the Power and Performance API ", published in Apple WWDC 2020
"Large-Scale Distributed Training with Dynamo and PyTorch/XLA SPMD ", published in PyTorch Conference 2023
"Auto-scaling LLM applications and workloads ", published in Aju University 2024
"PyTorch/XLA Auto-Sharding ", published in PyTorch Conference 2024
teaching
"Teaching experience 1 ", published in University 1, Department 2014
"Teaching experience 2 ", published in University 1, Department 2015