Trends in COVID-19 Publications: Streamlining Research Using NLP and LDA

Gupta, Akash and Aeron, Shrey and Agrawal, Anjali and Gupta, Himanshu (2021) Trends in COVID-19 Publications: Streamlining Research Using NLP and LDA. Frontiers in Digital Health, 3. ISSN 2673-253X

[thumbnail of pubmed-zip/versions/1/package-entries/fdgth-03-686720/fdgth-03-686720.pdf] Text
pubmed-zip/versions/1/package-entries/fdgth-03-686720/fdgth-03-686720.pdf - Published Version

Download (2MB)

Abstract

Background: Research publications related to the novel coronavirus disease COVID-19 are rapidly increasing. However, current online literature hubs, even with artificial intelligence, are limited in identifying the complexity of COVID-19 research topics. We developed a comprehensive Latent Dirichlet Allocation (LDA) model with 25 topics using natural language processing (NLP) techniques on PubMed® research articles about “COVID.” We propose a novel methodology to develop and visualise temporal trends, and improve existing online literature hubs.

Our results for temporal evolution demonstrate interesting trends, for example, the prominence of “Mental Health” and “Socioeconomic Impact” increased, “Genome Sequence” decreased, and “Epidemiology” remained relatively constant. Applying our methodology to LitCovid, a literature hub from the National Center for Biotechnology Information, we improved the breadth and depth of research topics by subdividing their pre-existing categories. Our topic model demonstrates that research on “masks” and “Personal Protective Equipment (PPE)” is skewed toward clinical applications with a lack of population-based epidemiological research.

Item Type: Article
Subjects: European Repository > Multidisciplinary
Depositing User: Managing Editor
Date Deposited: 10 Dec 2022 12:22
Last Modified: 08 May 2024 03:32
URI: http://go7publish.com/id/eprint/580

Actions (login required)

View Item
View Item