Thanh Thieu, PhD
Thanh Thieu, PhD
Program: Machine Learning
Research Program: Health Outcomes & Behavior Program
-
Overview
Dr. Thieu’s research interests center on using natural language processing (NLP) to process free text clinical notes in electronic health records and free text scientific reports in (bio)medical literature. His research spans whole-person functional status information, knowledge graph extraction, high throughput text mining, lexical complexity and language generation, and computer-assisted coding for healthcare and medical billing.
Associations
- Machine Learning
- Health Outcomes & Behavior Program
Education & Training
Graduate:
- University of Missouri, PhD - Computer Science
Fellowship:
- National Institute of Health Clinical Center - Health Informatics
-
Research Interest
Dr. Thieu’s research interests center on using natural language processing (NLP) to process free text clinical notes in electronic health records and free text scientific reports in (bio)medical literature. His research spans whole-person functional status information, knowledge graph extraction, high throughput text mining, lexical complexity and language generation, and computer-assisted coding for healthcare and medical billing.
-
Publications
- Amorrortu R, Garcia M, Zhao Y, El Naqa I, Balagurunathan Y, Chen DT, Thieu T, Schabath MB, Rollison DE. Overview of approaches to estimate real-world disease progression in lung cancer. JNCI Cancer Spectr. 2023 Oct.7(6). Pubmedid: 37738580. Pmcid: PMC10637832.
- Le TD, Nguyen PD, Korkin D, Thieu T. PHILM2Web: A high-throughput database of macromolecular host-pathogen interactions on the Web. Database (Oxford). 2022 Jun.2022. Pubmedid: 35776535. Pmcid: PMC9248916.
- Thieu T, Maldonado JC, Ho PS, Ding M, Marr A, Brandt D, Newman-Griffis D, Zirikly A, Chan L, Rasch E. A comprehensive study of mobility functioning information in clinical notes: Entity hierarchy, corpus annotation, and sequence labeling. Int J Med Inform. 2021 Mar.147:104351. Pubmedid: 33401169. Pmcid: PMC8104034.
- Newman-Griffis D, Porcino J, Zirikly A, Thieu T, Camacho Maldonado J, Ho PS, Ding M, Chan L, Rasch E. Broadening horizons: the case for capturing function and the role of health informatics in its use. BMC Public Health. 2019 Oct.19(1):1288. Pubmedid: 31615472. Pmcid: PMC6794808.
- Thieu T, Joshi S, Warren S, Korkin D. Literature mining of host-pathogen interactions: comparing feature-based supervised learning and language-based approaches. Bioinformatics. 2012 Mar.28(6):867-875. Pubmedid: 22285561.
-
Grants
- Title: Applying Large Language Models to Accelerate Abstraction of Cancer Pathology Reports for Cancer Registry (LLMs for Unstructured Data Extraction)
Sponsor: Nat Institutes of Health
PI: Cleveland, J., Project PI: Thieu, T.
- Title: Applying Large Language Models to Accelerate Abstraction of Cancer Pathology Reports for Cancer Registry (LLMs for Unstructured Data Extraction)