Title: AI-Assisted Chat Interface for Clinical Data
Submitter: Sri Sivani Charan Yalamanchi (Biostatistics)
Summary: Researchers currently rely on data analysts to retrieve information from complex clinical datasets, which introduces delays and limits iterative exploration of data. This project proposes a conversational AI interface using retrieval-augmented generation and Text-to-SQL to enable natural language querying, allowing users to directly access and interact with structured datasets in real time.
Benefit: The solution enables self-service access to complex datasets, significantly reducing reliance on intermediaries and shortening query turnaround time. It also enhances research productivity by enabling faster hypothesis testing, deeper data exploration, and more efficient use of institutional data resources.
Tools: RAG, Text-to-SQL, LLMs, Python, PostgreSQL
Test Data: MIMIC III/IV datasets
Any PHI, sensitive information, or otherwise confidential data use: No