EMS-BERT: A Pre-Trained Language Representation Model for the Emergency Medical Services (EMS) Domain

M. Arif Rahman, Sarah Masud Preum, Ronald D. Williams, Homa Alemzadeh, John Stankovic

Research output: Contribution to book or proceedingConference articlepeer-review

Abstract

Emergency Medical Services (EMS) is an important domain of healthcare. First responders save millions of lives per year. Machine learning and sensing technologies are actively being developed to support first responders in their EMS activities. However, there are significant challenges to overcome in developing these new solutions. One of the main challenges is the limitations of existing methods for EMS text mining, and developing a highly accurate language model for the EMS domain. Several important Bidirectional Encoder Representations from Transformer (BERT) models for medical domains, i.e., BioBERT and ClinicalBERT, have significantly influenced biomedical text mining tasks. But extracting information from the EMS domain is a separate challenge due to the uniqueness of the EMS domain, and the significant scarcity of a high-quality EMS corpus. In this research, we propose EMS-BERT - a BERT model specifically developed for EMS text-mining tasks. For data augmentation on our small, classified EMS corpus which consists of nearly 2.4M words, we use a simultaneous pre-training method for transfer-learning relevant information from medical, bio-medical, and clinical domains; and train a high-performance BERT model. Our thorough evaluation shows at least 2% to as much as 11% improvement of F-1 scores for EMS-BERT on different classification tasks, i.e., entity recognition, relation extraction, and inferring missing information when compared both with existing state-of-the-art clinical entity recognition tools, and with various medical BERT models.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE/ACM International Conference on Connected Health
Subtitle of host publicationApplications, Systems and Engineering Technologies, CHASE 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages34-43
Number of pages10
ISBN (Electronic)9798400701023
DOIs
StatePublished - 2023
Externally publishedYes
Event8th IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies, CHASE 2023 - Orlando, United States
Duration: Jun 21 2023Jun 23 2023

Publication series

NameProceedings - 2023 IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies, CHASE 2023

Conference

Conference8th IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies, CHASE 2023
Country/TerritoryUnited States
CityOrlando
Period06/21/2306/23/23

Scopus Subject Areas

  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Signal Processing
  • Biomedical Engineering
  • Health Informatics
  • Artificial Intelligence

Keywords

  • Emergency Medical Services (EMS) Data Processing and Analysis
  • EMS Entity Recognition
  • Language Model
  • Medicine and Health

Fingerprint

Dive into the research topics of 'EMS-BERT: A Pre-Trained Language Representation Model for the Emergency Medical Services (EMS) Domain'. Together they form a unique fingerprint.

Cite this