Loading…

Active deep learning for the identification of concepts and relations in electroencephalography reports

[Display omitted] •Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy i...

Full description

Saved in:
Bibliographic Details
Published in:Journal of biomedical informatics 2019-10, Vol.98, p.103265-103265, Article 103265
Main Authors: Maldonado, Ramon, Harabagiu, Sanda M.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3
cites cdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3
container_end_page 103265
container_issue
container_start_page 103265
container_title Journal of biomedical informatics
container_volume 98
creator Maldonado, Ramon
Harabagiu, Sanda M.
description [Display omitted] •Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset. The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports.
doi_str_mv 10.1016/j.jbi.2019.103265
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6922091</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1532046419301844</els_id><sourcerecordid>2283106330</sourcerecordid><originalsourceid>FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</originalsourceid><addsrcrecordid>eNp9UU1LAzEQDaL4_QO8SI5eWvO16S6CIOIXCF70HLLJbJuyTdYkLfjvTa0WvXjKhPfmzcx7CJ1RMqaEysv5eN66MSO0KX_OZLWDDmnF2YiImuxuaykO0FFKc0IorSq5jw44FRNCGnGIpjcmuxVgCzDgHnT0zk9xFyLOM8DOgs-uc0ZnFzwOHTbBGxhywtpbHKH_AhJ2HkMPJscAa3ym-zCNeph9FM4QYk4naK_TfYLT7_cYvd3fvd4-jp5fHp5ub55HRlQ0j6qOCdvUbFJXVVNLagShltlaM9Katms7WQ6HthUTNiGc80ZI2WhTA9eGkablx-h6ozss2wVYU_aPuldDdAsdP1TQTv1FvJupaVgp2bAiQIvAxbdADO9LSFktXDLQ99pDWCbFWM0pkZyTQqUbqokhpQjddgwlah2QmqsSkFoHpDYBlZ7z3_ttO34SKYSrDQGKSysHUSXj1qZaF4vBygb3j_wnA2yjGA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2283106330</pqid></control><display><type>article</type><title>Active deep learning for the identification of concepts and relations in electroencephalography reports</title><source>BACON - Elsevier - GLOBAL_SCIENCEDIRECT-OPENACCESS</source><source>ScienceDirect Journals</source><creator>Maldonado, Ramon ; Harabagiu, Sanda M.</creator><creatorcontrib>Maldonado, Ramon ; Harabagiu, Sanda M.</creatorcontrib><description>[Display omitted] •Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset. The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports.</description><identifier>ISSN: 1532-0464</identifier><identifier>EISSN: 1532-0480</identifier><identifier>DOI: 10.1016/j.jbi.2019.103265</identifier><identifier>PMID: 31470094</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Active learning ; Attribute classification ; Concept detection ; Deep learning ; Electroencephalography ; Long-distance relation identification</subject><ispartof>Journal of biomedical informatics, 2019-10, Vol.98, p.103265-103265, Article 103265</ispartof><rights>2019 Elsevier Inc.</rights><rights>Copyright © 2019 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</citedby><cites>FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,315,786,790,891,27957,27958</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/31470094$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Maldonado, Ramon</creatorcontrib><creatorcontrib>Harabagiu, Sanda M.</creatorcontrib><title>Active deep learning for the identification of concepts and relations in electroencephalography reports</title><title>Journal of biomedical informatics</title><addtitle>J Biomed Inform</addtitle><description>[Display omitted] •Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset. The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports.</description><subject>Active learning</subject><subject>Attribute classification</subject><subject>Concept detection</subject><subject>Deep learning</subject><subject>Electroencephalography</subject><subject>Long-distance relation identification</subject><issn>1532-0464</issn><issn>1532-0480</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9UU1LAzEQDaL4_QO8SI5eWvO16S6CIOIXCF70HLLJbJuyTdYkLfjvTa0WvXjKhPfmzcx7CJ1RMqaEysv5eN66MSO0KX_OZLWDDmnF2YiImuxuaykO0FFKc0IorSq5jw44FRNCGnGIpjcmuxVgCzDgHnT0zk9xFyLOM8DOgs-uc0ZnFzwOHTbBGxhywtpbHKH_AhJ2HkMPJscAa3ym-zCNeph9FM4QYk4naK_TfYLT7_cYvd3fvd4-jp5fHp5ub55HRlQ0j6qOCdvUbFJXVVNLagShltlaM9Katms7WQ6HthUTNiGc80ZI2WhTA9eGkablx-h6ozss2wVYU_aPuldDdAsdP1TQTv1FvJupaVgp2bAiQIvAxbdADO9LSFktXDLQ99pDWCbFWM0pkZyTQqUbqokhpQjddgwlah2QmqsSkFoHpDYBlZ7z3_ttO34SKYSrDQGKSysHUSXj1qZaF4vBygb3j_wnA2yjGA</recordid><startdate>20191001</startdate><enddate>20191001</enddate><creator>Maldonado, Ramon</creator><creator>Harabagiu, Sanda M.</creator><general>Elsevier Inc</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20191001</creationdate><title>Active deep learning for the identification of concepts and relations in electroencephalography reports</title><author>Maldonado, Ramon ; Harabagiu, Sanda M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Active learning</topic><topic>Attribute classification</topic><topic>Concept detection</topic><topic>Deep learning</topic><topic>Electroencephalography</topic><topic>Long-distance relation identification</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Maldonado, Ramon</creatorcontrib><creatorcontrib>Harabagiu, Sanda M.</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of biomedical informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Maldonado, Ramon</au><au>Harabagiu, Sanda M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Active deep learning for the identification of concepts and relations in electroencephalography reports</atitle><jtitle>Journal of biomedical informatics</jtitle><addtitle>J Biomed Inform</addtitle><date>2019-10-01</date><risdate>2019</risdate><volume>98</volume><spage>103265</spage><epage>103265</epage><pages>103265-103265</pages><artnum>103265</artnum><issn>1532-0464</issn><eissn>1532-0480</eissn><notes>ObjectType-Article-1</notes><notes>SourceType-Scholarly Journals-1</notes><notes>ObjectType-Feature-2</notes><notes>content type line 23</notes><abstract>[Display omitted] •Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset. The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>31470094</pmid><doi>10.1016/j.jbi.2019.103265</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1532-0464
ispartof Journal of biomedical informatics, 2019-10, Vol.98, p.103265-103265, Article 103265
issn 1532-0464
1532-0480
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6922091
source BACON - Elsevier - GLOBAL_SCIENCEDIRECT-OPENACCESS; ScienceDirect Journals
subjects Active learning
Attribute classification
Concept detection
Deep learning
Electroencephalography
Long-distance relation identification
title Active deep learning for the identification of concepts and relations in electroencephalography reports
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-09-22T02%3A27%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Active%20deep%20learning%20for%20the%20identification%20of%20concepts%20and%20relations%20in%20electroencephalography%20reports&rft.jtitle=Journal%20of%20biomedical%20informatics&rft.au=Maldonado,%20Ramon&rft.date=2019-10-01&rft.volume=98&rft.spage=103265&rft.epage=103265&rft.pages=103265-103265&rft.artnum=103265&rft.issn=1532-0464&rft.eissn=1532-0480&rft_id=info:doi/10.1016/j.jbi.2019.103265&rft_dat=%3Cproquest_pubme%3E2283106330%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2283106330&rft_id=info:pmid/31470094&rfr_iscdi=true