The bionic DBMS is coming, but what will it look like?
Ryan Johnson, Ippokratis Pandis
CIDR 2013
Clinical databases are essential for clinical and translational research. Traditionally, curating a clinical database involves manually collecting data from free text notes within the electronic medical record (EMR), but this process is time-consuming and error prone. Recently, Large Language Models (LLMs) such as OpenAI's ChatGPT and Google's Gemini have demonstrated impressive semantic understanding of free text, and could be used to automate the free text data extraction tasks that once could only be done using human experts and trainees. Unfortunately, these free text notes often contain protected health information, and moreover embody a valuable asset, leading health systems to restrict their transfer to entities like the third party AI providers mentioned above. The goal of this study is to evaluate the feasibility of avoiding data transfer by using an open source AI model to generate a clinical database of kidney cancer patients from free text radiology, pathology, and operative notes.
Ryan Johnson, Ippokratis Pandis
CIDR 2013
Hagen Soltau, Lidia Mangu, et al.
ASRU 2011
Xiaoxiao Guo, Shiyu Chang, et al.
AAAI 2019
Vladimir Yanovski, Israel A. Wagner, et al.
Ann. Math. Artif. Intell.