Skip to nav Skip to content
Thanh  Thieu

Thanh Thieu, PhD

Program: Machine Learning

Research Program: Health Outcomes & Behavior Program

View Lab Page

Contact

  • Overview

    Dr. Thieu’s research interests center on using natural language processing (NLP) to process free text clinical notes in electronic health records and free text scientific reports in (bio)medical literature. His research spans whole-person functional status information, knowledge graph extraction, high throughput text mining, lexical complexity and language generation, and computer-assisted coding for healthcare and medical billing.

    Associations

    • Machine Learning
    • Health Outcomes & Behavior Program

    Education & Training

    Graduate:

    • University of Missouri, PhD - Computer Science

    Fellowship:

    • National Institute of Health Clinical Center - Health Informatics
  • Research Interest

    Dr. Thieu’s research interests center on using natural language processing (NLP) to process free text clinical notes in electronic health records and free text scientific reports in (bio)medical literature. His research spans whole-person functional status information, knowledge graph extraction, high throughput text mining, lexical complexity and language generation, and computer-assisted coding for healthcare and medical billing.

  • Publications

    • Amorrortu R, Garcia M, Zhao Y, El Naqa I, Balagurunathan Y, Chen DT, Thieu T, Schabath MB, Rollison DE. Overview of approaches to estimate real-world disease progression in lung cancer. JNCI Cancer Spectr. 2023 Oct.7(6). Pubmedid: 37738580. Pmcid: PMC10637832.
    • Le TD, Nguyen PD, Korkin D, Thieu T. PHILM2Web: A high-throughput database of macromolecular host-pathogen interactions on the Web. Database (Oxford). 2022 Jun.2022. Pubmedid: 35776535. Pmcid: PMC9248916.
    • Thieu T, Maldonado JC, Ho PS, Ding M, Marr A, Brandt D, Newman-Griffis D, Zirikly A, Chan L, Rasch E. A comprehensive study of mobility functioning information in clinical notes: Entity hierarchy, corpus annotation, and sequence labeling. Int J Med Inform. 2021 Mar.147:104351. Pubmedid: 33401169. Pmcid: PMC8104034.
    • Newman-Griffis D, Porcino J, Zirikly A, Thieu T, Camacho Maldonado J, Ho PS, Ding M, Chan L, Rasch E. Broadening horizons: the case for capturing function and the role of health informatics in its use. BMC Public Health. 2019 Oct.19(1):1288. Pubmedid: 31615472. Pmcid: PMC6794808.
    • Thieu T, Joshi S, Warren S, Korkin D. Literature mining of host-pathogen interactions: comparing feature-based supervised learning and language-based approaches. Bioinformatics. 2012 Mar.28(6):867-875. Pubmedid: 22285561.
  • Grants

    • Title: Applying Large Language Models to Accelerate Abstraction of Cancer Pathology Reports for Cancer Registry (LLMs for Unstructured Data Extraction)
      Sponsor: Nat Institutes of Health
      PI: Cleveland, J., Project PI: Thieu, T.

Find a Researcher Search