Natural Language Processing (NLP) applications have the potential to aid in situations where humans can benefit from augmented decision-making. This is particularly important in the healthcare domain as an overabundance of information within electronic health records (EHRs) has resulted in a need for automated systems to mitigate the cognitive burden on physicians utilizing today’s EHR systems. Since the early 2000s, community challenges, or shared tasks, have pushed development of NLP systems and tested the state-of-the-art for applications of these technologies in a variety of increasingly more complex challenge tasks in the clinical domain. In addition to use cases in the clinical setting, secondary uses of EHR data, such as clinical trial cohort selection, insurance underwriting, and adverse drug event reporting are also in need of such information extraction and text summarization applications.
Significant gaps remain between existing NLP shared tasks and systems integrated into the workflow of real clinical applications and real-world use cases\(^{1}\). To help bridge these gaps, we propose a new shared task to extract evidence of clinical factors from clinical notes for a given medical condition, where a clinical factor is defined as information on the screening, diagnosis, or management of the condition according to its relevant clinical practice guidelines. The main differentiators of this task are,
The conditions were chosen based on a published analysis of the top admission numbers by ICD-9 diagnosis within the MIMC-III database published by Huang et al. Our selection of conditions was also driven by an analysis of 2019 Market Scan claims data to identify high-cost medical conditions and episodes of care. Data for 3 conditions will be released in the training phase, and 2 additional conditions will be reserved as testing data.
Clinical notes written by residents and attending physicians were selected from MIMIC (Medical Information Mart for Intensive Care), a single-center dataset comprised of de-identified data relating to patients admitted to critical care units at a large tertiary care hospital. Data associated with approximately 60,000 critical care admissions includes structured information such as vital signs, medications, laboratory measurements, and unstructured notes charted by care providers. The database has been extensively used to support applications ranging from academic research, quality improvement initiatives, and predictive modeling. The annotations are created by 6 physicians using 1,500 admission notes and discharge summaries selected from MIMIC-III.
A candidate list of clinical factors for each clinical condition was compiled by a review of clinical guidelines authored by a medical society or association (e.g., American College of Cardiology) and a point of care clinical knowledge resource (e.g., UpToDate by Wolters Kluwer). Clinical factors were considered across the spectrum of evaluation and management, and could encompass specific symptoms, narrative histories, or physical exam findings. This list of candidate clinical factors was refined by eliminating any clinical factor represented by a structured data field within an EMR (e.g., demographic information, problems lists, medication orders, laboratory results). The final list of clinical factors defined are represented as unstructured text or spans of text within the clinical notes. Two of the conditions and a subset of clinical factors identified for this challenge task are represented in the following table
Clinical Condition | Clinical Factors |
---|---|
Hypertension | Tobacco use Excess alcohol intake Non-adherence |
Diabetes Type II | First degree relative with DM Hyperglycemic symptoms Tobacco use |
Training set: The registered participants of the task will be provided with gold standard annotations created by medical experts on over 900 MIMIC-III clinical notes along with their corresponding ROW_IDs in noteevents table. Of these, 600 clinical notes will be used as training set , 300 clinical notes will be used as validation set.
Test set: The ROW_IDs of 600 MIMIC-III clinical notes will be released to the registered participations during the evaluation phase. The registered participants are expected to provide their system predictions on these notes in CONLL-2002 format (similar to training set annotations format).
To be added.
Call for participation, with information about the training data to be annouced at AMIA Natural Language Working Group Pre-symposium. Watch this space for more details in the coming months.
The IBM Challenge Team consists of a multidisciplinary group of researchers, scientists, clinicians and developers from IBM Research and IBM Watson Health. We have extensive training in NLP development, implementation, and evaluation and have published widely on topics related to state of the art including evaluation and annotation, extraction of clinical concepts and relations, identification of contextual features, and textual summarization. Our team has participated and placed highly in many clinical NLP challenges since 2017 (TAC 2017-19, MADE 1.0, N2C2 2018-19, mediQA 2021), and has been curating and sharing annotated datasets EMRQA, WNTRAC with the research community.
Parthasarathy Suryanarayanan is a Senior Technical Staff Member and Chief Architect in the Healthcare & Life Sciences department at IBM Research. He has about 17 years of industry experience building scalable information processing architectures that enable both collaborative research and rapid commercialization. As the engineering lead in one of IBM’s research teams, he has spent the last decade in applying Natural Language Processing to the medical domain, building various research prototypes and helping them turn into business offerings. His interests include natural language processing, machine learning, algorithm design, information retrieval and software engineering.
Mario J. Lorenzo currently serves as Chief Architect for Healthcare NLP in IBM Watson Health. He has 16 years of industry experience developing systems and software for IBM. During the last ten years, he has focused on applying AI and NLP to complex process transformation problems across Healthcare, Life Sciences, and Financial Services industries. Mario pioneered the first commercialization of IBM Watson in healthcare and holds numerous patents, publications, and technical awards in this field. Mario earned his Ph.D. in Computer Science with a focus on AI-based NLP. and a M.S. in Computer Science with a focus on Distributed computing.
Jennifer J. Liang is a Medical Researcher in the Healthcare and Life Sciences department at IBM Research. As the resident subject matter expert, she works closely with AI researchers to develop analytics for use in the medical domain based on natural language processing and machine learning. Dr. Liang has been with IBM for more than 7 years, supporting both Research and Watson Health in creating annotated datasets, developing and evaluating new analytics, and adapting existing technologies to new client data. Her current focus is on applications for use in the clinical setting, including analytics on electronic medical records and clinical decision support systems. She holds an MD from New York Medical College and a BS in Materials Science and Engineering from Massachusetts Institute of Technology.
Ching-Huei Tsou is a research staff member and manager in the Healthcare & Life Sciences department at IBM Research. His areas of expertise include machine learning, natural language processing, numerical optimization, and software engineering. Dr. Tsou joined IBM's Jeopardy-wining research team in 2012 and contributed in the initiative in shifting the research emphasis from general question-answering to exploring the use of natural language processing to benefit healthcare. His current research interest is in building health informatics to support clinical decision making, specifically, in the area of extracting clinical insights and generating summarizations from unstructured patient records.
Brett R South is a Biomedical Informatician working with the Center for AI, Research, and Evaluation (CARE) within IBM Watson Health. His areas of expertise include Natural Language Processing (NLP), data science, clinical informatics, and evaluation methods. Brett has led a diverse array of NLP studies for various information extraction and classification tasks. Brett has a strong love and passion for implementation of technology applied to healthcare particularly as it relates to the use of AI and its potential impact on patient outcomes and clinical decision making. He is a member of AMIA and serves on the editorial board for the journal Applied Clinical Informatics.
Bharath Dandala is a research staff member in the Healthcare & Life Sciences department at IBM Research. In this role, Dr Dandala researches and develops computational methods in the field of Computational Healthcare. Dr. Dandala has been with IBM for more than seven years, supporting multiple research projects, establishing innovative technologies and methods that enable state-of-the-art natural language processing systems in both general and clinical domain. His area of specialty include machine learning, natural language Processing, and software engineering. Dr. Dandala holds a Ph.D. and master’s in computer science from University of North Texas, and a Bachelor of Technology from Jawaharlal Nehru Technological University.
Alice Landis-McGrath is a physician with over 19 years of informatics experience, is an Associate Chief Health Officer at IBM Watson Health, where she focuses on the use of data, analytic tools, and clinical decision support technologies to transform clinical and operational processes. She provides global clinical subject matter expertise for the Life Sciences Real World Data and Provider Oncology solutions portfolios. In this role, she provides sales and strategy support for life sciences and provider organizations looking to leverage data and cognitive technologies to transform the product life cycle, clinical research and clinical delivery. Dr. Landis-McGrath earned her MD from the University of Pennsylvania School of Medicine, and her BA in Chemistry from Franklin & Marshall College.
Wendy W Chapman, Prakash M Nadkarni, Lynette Hirschman, Leonard W D'Avolio, Guergana K Savova, Ozlem Uzuner, Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions, Journal of the American Medical Informatics Association, Volume 18, Issue 5, September 2011, Pages 540–543, https://doi.org/10.1136/amiajnl-2011-000465
Özlem Uzuner, Imre Solti, Eithon Cadag, Extracting medication information from clinical text, Journal of the American Medical Informatics Association, Volume 17, Issue 5, September 2010, Pages 514–518, https://doi.org/10.1136/jamia.2010.003947
Özlem Uzuner, Brett R South, Shuying Shen, Scott L DuVall, 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text, Journal of the American Medical Informatics Association, Volume 18, Issue 5, September 2011, Pages 552–556, https://doi.org/10.1136/amiajnl-2011-000203
Ozlem Uzuner, Andreea Bodnari, Shuying Shen, Tyler Forbush, John Pestian, Brett R South, Evaluating the state of the art in coreference resolution for electronic medical records, Journal of the American Medical Informatics Association, Volume 19, Issue 5, September 2012, Pages 786–791, https://doi.org/10.1136/amiajnl-2011-000784
Weiyi Sun, Anna Rumshisky, Ozlem Uzuner, Evaluating temporal relations in clinical text: 2012 i2b2 Challenge, Journal of the American Medical Informatics Association, Volume 20, Issue 5, September 2013, Pages 806–813, https://doi.org/10.1136/amiajnl-2013-001628
Weiyi Sun, Anna Rumshisky, Ozlem Uzuner, Annotating temporal information in clinical narratives, Journal of Biomedical Informatics, Volume 46, Supplement,2013, Pages S5-S12, ISSN 1532-0464, https://doi.org/10.1016/j.jbi.2013.07.004
Vishesh Kumar, Amber Stubbs, Stanley Shaw, Özlem Uzuner, Creation of a new longitudinal corpus of clinical narratives, Journal of Biomedical Informatics, Volume 58, Supplement, 2015, Pages S6-S10, ISSN 1532-0464, https://doi.org/10.1016/j.jbi.2015.09.018
Sam Henry, Kevin Buchan, Michele Filannino, Amber Stubbs, Ozlem Uzuner, 2018 n2c2 shared task on adverse drug events and medication extraction in electronic health records, Journal of the American Medical Informatics Association, Volume 27, Issue 1, January 2020, Pages 3–12, https://doi.org/10.1093/jamia/ocz166