About Me

Hi, my name is Yin Lin (林茵 in Chinese). I am a senior algorithm engineer at Alibaba Group, where I explore AI-driven solutions for enhancing data analytics and management. Before joining Alibaba, I earned my Ph.D. from the University of Michigan, Ann Arbor, where I had the privilege of being advised by Dr. H. V. Jagadish. I was also a member of the Database Group. My thesis research focused on data equity systems, aiming to prevent the misuse and misinterpretation of big data.


Before my Ph.D. studies at UM, I obtained my bachelor's degree from Shanghai Jiao Tong University in 2019.



    What's new!
  • Our paper: Large Language Models as Pretrained Data Engineers: Techniques and Opportunities are accepted to IEEE Data Engineering Bulletin 2025!

Publications

1. Large Language Models as Pretrained Data Engineers: Techniques and Opportunities (pdf)


Yin Lin, Bolin Ding, Jingren Zhou

IEEE Data Engineering Bulletin 2025

2. Efficient Row-Level Lineage Leveraging Predicate Pushdown (pdf)


Yin Lin, Cong Yan

CoRR, 2024, Arxiv/2412.16864

3. Mitigating Subgroup Unfairness in Machine Learning Classifiers: A Data-Driven Approach (pdf)


Yin Lin, Samika Gupta, H. V. Jagadish

ICDE 2024

4. SMARTFEAT: Efficient Feature Construction through Feature-Level Foundation Model Interactions (pdf)


Yin Lin, Bolin Ding, H. V. Jagadish, Jingren Zhou

CIDR 2024

5. Predicate Pushdown for Data Science Pipelines. (pdf)


Cong Yan, Yin Lin, Yeye He.

SIGMOD 2023 (best paper award)

6. Representation Bias in Data: A Survey on Identification and Resolution Techniques. (pdf)


Nima Shahbazi, Yin Lin, Abolfazl Asudeh, H. V. Jagadish.

ACM Computing Surveys

7. OREO: Detection of Cherry-picked Generalizations. (pdf)


Yin Lin, Brit Youngmann, Yuval Moskovitch, H. V. Jagadish, Tova Milo

VLDB 2022 - Demo

8. On Detecting Cherry-picked Generalizations. (pdf)


Yin Lin, Brit Youngmann, Yuval Moskovitch, H. V. Jagadish, Tova Milo

VLDB 2022

9. Identifying Insufficient Data Coverage in Databases with Multiple Relations. (pdf)


Yin Lin, Yifan Guan, Abolfazl Asudeh, H. V. Jagadish

VLDB 2020

10. MithraDetective: A System for Cherry-picked Trendlines Detection. (pdf)


Yoko Nagafuchi, Yin Lin, Kaushal Mamgain, Abolfazl Asudeh, H. V. Jagadish, You (Will) Wu, Cong Yu

CoRR, 2020, Arxiv/2010.08807

11. On Structural vs. Proximity-based Temporal Node Embeddings. (pdf)


Puja Trivedi, Alican Büyükçakır, Yin Lin, Yinlong Qian, Di Jin, Danai Koutra

MLG@KDD 2020

12. R2-Tree: An Efficient Indexing Scheme for Server-Centric Data Center Networks. (pdf)


Yin Lin, Xinyi Chen, Xiaofeng , Guihai Chen

DEXA 2018


Education

Ph.D. student: Sept. 2019 - Dec. 2024

University of Michigan

Computer Science and Engineering (CSE)


Bachelor: Sept. 2015 - June 2019

Shanghai Jiao Tong University

Computer Science, School of Electronic Information and Electrical Engineering ( CS )

Experience

Reseach Intern: May 2023 - Aug. 2023

Alibaba Group, Data Analytics and Intelligence Lab (DAIL), Damo Academy


Reseach Intern: June 2022 - Aug. 2022

Microsoft Research, Data Management, Exploration and Mining (DMX)


Summer Intern: May 2018 - Jul. 2018

University of Waterloo, Software Architecture Group

Scholarships and Awards

Best Paper Award at SIGMOD 2023

NSF Travel Award for ICDE 2024

Rackham Dean’s and Named PhD fellowship

Outstanding Undergraduate in Shanghai Jiao Tong University

Chun Tsung Scholar from Shanghai Jiao Tong University

Program Committees

AAAI Workshop on Artificial Intelligence with Biased or Scarce Data (AIBSD), 2024

32nd ACM International Conference on Information and Knowledge Management (CIKM), 2023, 2024