Biomaterials text mining: A hands-on comparative study of methods on polydioxanone biocompatibility

ElsevierVolume 77, 25 November 2023, Pages 161-175New BiotechnologyAuthor links open overlay panel, , , , Highlights•

This is the first hands-on application of text mining tools (TMT) in Biomaterials.

We apply multiple TMTs to extract information from the Biomaterials literature.

TMTs produce an informative research map of polydioxanone with main topics & trends.

TMTs also highlight research gaps, missing assets & unresolved obstacles.

We showcase NER’s potential to extract deep data & drive discoveries in Biomaterials.

Abstract

Scientific information extraction is fundamental for research and innovation, but is currently mostly a manual, time-consuming process. Text Mining tools (TMTs) enable automated, accurate and quick information extraction from text, but there is little precedent of their use in the biomaterials field. Here, we compare the ability of various TMTs to extract useful information from biomaterials abstracts. Focusing on the biocompatibility of polydioxanone, a biodegradable polymer for which there are relatively few scientific publications, we tested several tools ranging from machine learning approaches and statistical text analysis to MeSH indexing and domain-specific semantic tools for Named Entity Recognition. We also evaluated their output alongside a manual review of systematic reviews and meta-analyses. The findings show that TMTs can be highly efficient and powerful for mapping biomaterials texts and rapidly yield up-to-date information. Here, TMTs enable one to identify dominating themes, see the evolution of specific terms and topics, and learn about key medical applications in biomaterials literature over the years. The analysis also shows that ambiguity around biomaterials nomenclature is a significant challenge in mining biomedical literature that is yet to be tackled. This research showcases the potential value of using Natural Language Processing and domain-specific tools to extract and organize biomaterials data.

AbbreviationsBCTEO

Bone and Cartilage Tissue Engineering Ontology

CHEBI

Chemical Entities of Biological Interest

CT/MRI

Computed Tomography/Magnetic Resonance Imaging

DEB

Devices, Experimental Scaffolds and Biomaterials Ontology

DEBBIE

Database of Experimental Biomaterials and their Biological Effect

GMDN

Global Medical Device Nomenclature

hLDA

Hierarchical Latent Dirichlet Allocation

MEDLINE

National Library of Medicine

MeSH

Medical Subject Headings

NER

Named Entity Recognition

NLP

Natural Language Processing

PMID

PubMed Unique Identifier

RCT

Randomized Clinical Trial

RN

Registry Number/EC Number

SGD

Stochastic Gradient Descent

SR&MAs

Systematic Reviews and Meta-Analyses

Keywords

Biomaterials

Text mining

Polydioxanone

Biocompatibility

Information extraction

© 2023 The Authors. Published by Elsevier B.V.

留言 (0)

沒有登入
gif