Lily Ge

~ se @ uwaterloo · ir & nlp ~

Hello! I'm Lily Ge; I also publish and collaborate under my given name, Yijun Ge. I'm a Software Engineering student at the University of Waterloo. My interests lie in Natural Language Processing (NLP), Information Retrieval (IR), Agentic Systems, and Robots. I am fortunate to be advised by Professor Jimmy Lin.

Experiences

AI Engineer

Yupp AI · Aug 2025 - Mar 2026 · San Francisco, CA

I built RAG pipelines, enabled citation extraction across various model providers, and prepared data for annotation.

Publications

Yijun Ge*, Kushaldeep Mujral*, Karthik Nambiar, and Jimmy Lin. 2026. rosaOS: Agentic Operating System for Embodied LLMs. Accepted to ACL 2026 (Demonstration Track).

Yijun Ge, Zibo Guo, Sahel Sharifymoghaddam, and Jimmy Lin. 2026. MCP Servers for Pyserini and RankLLM: Enabling Agentic Retrieval-Augmented Generation. Accepted to SIGIR 2026 (Demonstration Track).

Sahel Sharifymoghaddam*, Yijun Ge*, Raghav Vasudeva, and Jimmy Lin. 2026. Lighting the Way for BRIGHT: Reproducible Baselines with Anserini, Pyserini, and RankLLM. Accepted to SIGIR 2026 (Reproducibility Track). arXiv:2509.02558.

Yijun Ge, Zijian Chen, and Jimmy Lin. 2025. QuackIR: Retrieval in DuckDB and Other Relational Database Management Systems. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 492-500, Suzhou (China). Association for Computational Linguistics.

Projects

Ruth's Super Amazing AI Adventure!

Python · LangChain · Pinecone · SQLite · Groq · Flask · Diffusers

An AI-driven visual novel with LLM dialogue, RAG-backed long-term memory, local stable diffusion, and custom game logic through a Flask backend.

AccessAIbility

GeeseHacks 2025 Award Winner· Devpost · Python · MediaPipe · OpenCV

Webcam-based keyboard input from ASL hand gestures and MediaPipe gesture shortcuts for control keys, with modular AI components and real-time OpenCV video.

Misc