About me

I am a third-year PhD student in CS at the University of Wisconsin-Madison, advised by Professor Frederic Sala. My research focus is on large language models and foundation models; I am particularly interested in i) how to improve their performance,particularly via data selection and curation and ii) how to evaluate them.

I am actively seeking a Summer 2026 internship in related fields, please feel free to reach out to me at zhiqi [at] cs [dot] wisc [dot] edu

Publications

Conference Publications

Journal Publications

Workshop Publications

Industry Experience

Roblox Corporation

Software Engineer Intern, AI/ML Team May 2023 -- Aug. 2023

  • Designed, developed, and deployed a full-stack project with a Slack Bot that integrates Vector Database & Large Language Models (LLMs) which can perform complex Q&A based on custom knowledge by Retrieval-Augmented Generation (RAG), resulting in a better solution that outperformed the existing Question Answering Slack Bot within the company.
  • Created an efficient data pipeline, ingesting diverse documents (Confluence, Stack Overflow, GitHub) and generating vector embeddings for rapid retrieval.

Teaching Experience

Comp Sci 540 — Introduction to Artificial Intelligence

Fall 2024, Spring 2025

Comp Sci 300 — Programming II

Fall 2023, Spring 2024

Services

Served as a reviewer for NeurIPS 2024, 2025, ES-FoMo@ICML2024

Undergraduate Projects

Tessellations on the Poincaré Half-Plane and Disk

Advisor: Professor Andrew Zimmer.

  • Contributed to the “Tessellations on the Poincaré Half-Plane and Disk” project in the Summer 2022 UW-Madison Research Experiences for Undergraduates (REU) in Analysis funded by the National Science Foundation (NSF). Developed a visualization tool to demonstrate principles of hyperbolic geometry for education purposes, allowing users to generate and explore tessellations on the Poincaré disk and half-plane, aiding students in comprehending complex concepts.
    [Poster PDF]

Random Walks on Groups

Advisor: Dr. Nate Fisher

  • Participated in a group project at Madison Experimental Mathematics Lab. Implemented Mathematica simulations to investigate the asymptotic properties of random walks on algebraic structures like $\mathbb{Z}^n$ and the Heisenberg group, quantifying metrics and analyzing their long-term pattern, such as expected travel distance, expectation of hitting time, and distribution of hitting location.
    [Poster PDF]