Loading…

Active deep learning for the identification of concepts and relations in electroencephalography reports

[Display omitted] •Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy i...

Full description

Saved in:

Bibliographic Details
Published in:	Journal of biomedical informatics 2019-10, Vol.98, p.103265-103265, Article 103265
Main Authors:	Maldonado, Ramon, Harabagiu, Sanda M.
Format:	Article
Language:	English
Subjects:	Active learning Attribute classification Concept detection Deep learning Electroencephalography Long-distance relation identification
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3
cites	cdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3
container_end_page	103265
container_issue
container_start_page	103265
container_title	Journal of biomedical informatics
container_volume	98
creator	Maldonado, Ramon Harabagiu, Sanda M.
description	[Display omitted] •Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset. The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports.
doi_str_mv	10.1016/j.jbi.2019.103265
format	article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6922091</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1532046419301844</els_id><sourcerecordid>2283106330</sourcerecordid><originalsourceid>FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</originalsourceid><addsrcrecordid>eNp9UU1LAzEQDaL4_QO8SI5eWvO16S6CIOIXCF70HLLJbJuyTdYkLfjvTa0WvXjKhPfmzcx7CJ1RMqaEysv5eN66MSO0KX_OZLWDDmnF2YiImuxuaykO0FFKc0IorSq5jw44FRNCGnGIpjcmuxVgCzDgHnT0zk9xFyLOM8DOgs-uc0ZnFzwOHTbBGxhywtpbHKH_AhJ2HkMPJscAa3ym-zCNeph9FM4QYk4naK_TfYLT7_cYvd3fvd4-jp5fHp5ub55HRlQ0j6qOCdvUbFJXVVNLagShltlaM9Katms7WQ6HthUTNiGc80ZI2WhTA9eGkablx-h6ozss2wVYU_aPuldDdAsdP1TQTv1FvJupaVgp2bAiQIvAxbdADO9LSFktXDLQ99pDWCbFWM0pkZyTQqUbqokhpQjddgwlah2QmqsSkFoHpDYBlZ7z3_ttO34SKYSrDQGKSysHUSXj1qZaF4vBygb3j_wnA2yjGA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2283106330</pqid></control><display><type>article</type><title>Active deep learning for the identification of concepts and relations in electroencephalography reports</title><source>BACON - Elsevier - GLOBAL_SCIENCEDIRECT-OPENACCESS</source><source>ScienceDirect Journals</source><creator>Maldonado, Ramon ; Harabagiu, Sanda M.</creator><creatorcontrib>Maldonado, Ramon ; Harabagiu, Sanda M.</creatorcontrib><description>[Display omitted] •Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset. The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports.</description><identifier>ISSN: 1532-0464</identifier><identifier>EISSN: 1532-0480</identifier><identifier>DOI: 10.1016/j.jbi.2019.103265</identifier><identifier>PMID: 31470094</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Active learning ; Attribute classification ; Concept detection ; Deep learning ; Electroencephalography ; Long-distance relation identification</subject><ispartof>Journal of biomedical informatics, 2019-10, Vol.98, p.103265-103265, Article 103265</ispartof><rights>2019 Elsevier Inc.</rights><rights>Copyright © 2019 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</citedby><cites>FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,315,786,790,891,27957,27958</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/31470094$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Maldonado, Ramon</creatorcontrib><creatorcontrib>Harabagiu, Sanda M.</creatorcontrib><title>Active deep learning for the identification of concepts and relations in electroencephalography reports</title><title>Journal of biomedical informatics</title><addtitle>J Biomed Inform</addtitle><description>[Display omitted] •Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset. The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports.</description><subject>Active learning</subject><subject>Attribute classification</subject><subject>Concept detection</subject><subject>Deep learning</subject><subject>Electroencephalography</subject><subject>Long-distance relation identification</subject><issn>1532-0464</issn><issn>1532-0480</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9UU1LAzEQDaL4_QO8SI5eWvO16S6CIOIXCF70HLLJbJuyTdYkLfjvTa0WvXjKhPfmzcx7CJ1RMqaEysv5eN66MSO0KX_OZLWDDmnF2YiImuxuaykO0FFKc0IorSq5jw44FRNCGnGIpjcmuxVgCzDgHnT0zk9xFyLOM8DOgs-uc0ZnFzwOHTbBGxhywtpbHKH_AhJ2HkMPJscAa3ym-zCNeph9FM4QYk4naK_TfYLT7_cYvd3fvd4-jp5fHp5ub55HRlQ0j6qOCdvUbFJXVVNLagShltlaM9Katms7WQ6HthUTNiGc80ZI2WhTA9eGkablx-h6ozss2wVYU_aPuldDdAsdP1TQTv1FvJupaVgp2bAiQIvAxbdADO9LSFktXDLQ99pDWCbFWM0pkZyTQqUbqokhpQjddgwlah2QmqsSkFoHpDYBlZ7z3_ttO34SKYSrDQGKSysHUSXj1qZaF4vBygb3j_wnA2yjGA</recordid><startdate>20191001</startdate><enddate>20191001</enddate><creator>Maldonado, Ramon</creator><creator>Harabagiu, Sanda M.</creator><general>Elsevier Inc</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20191001</creationdate><title>Active deep learning for the identification of concepts and relations in electroencephalography reports</title><author>Maldonado, Ramon ; Harabagiu, Sanda M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Active learning</topic><topic>Attribute classification</topic><topic>Concept detection</topic><topic>Deep learning</topic><topic>Electroencephalography</topic><topic>Long-distance relation identification</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Maldonado, Ramon</creatorcontrib><creatorcontrib>Harabagiu, Sanda M.</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of biomedical informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Maldonado, Ramon</au><au>Harabagiu, Sanda M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Active deep learning for the identification of concepts and relations in electroencephalography reports</atitle><jtitle>Journal of biomedical informatics</jtitle><addtitle>J Biomed Inform</addtitle><date>2019-10-01</date><risdate>2019</risdate><volume>98</volume><spage>103265</spage><epage>103265</epage><pages>103265-103265</pages><artnum>103265</artnum><issn>1532-0464</issn><eissn>1532-0480</eissn><notes>ObjectType-Article-1</notes><notes>SourceType-Scholarly Journals-1</notes><notes>ObjectType-Feature-2</notes><notes>content type line 23</notes><abstract>[Display omitted] •Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset. The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>31470094</pmid><doi>10.1016/j.jbi.2019.103265</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1532-0464
ispartof	Journal of biomedical informatics, 2019-10, Vol.98, p.103265-103265, Article 103265
issn	1532-0464 1532-0480
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6922091
source	BACON - Elsevier - GLOBAL_SCIENCEDIRECT-OPENACCESS; ScienceDirect Journals
subjects	Active learning Attribute classification Concept detection Deep learning Electroencephalography Long-distance relation identification
title	Active deep learning for the identification of concepts and relations in electroencephalography reports
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-09-22T02%3A27%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Active%20deep%20learning%20for%20the%20identification%20of%20concepts%20and%20relations%20in%20electroencephalography%20reports&rft.jtitle=Journal%20of%20biomedical%20informatics&rft.au=Maldonado,%20Ramon&rft.date=2019-10-01&rft.volume=98&rft.spage=103265&rft.epage=103265&rft.pages=103265-103265&rft.artnum=103265&rft.issn=1532-0464&rft.eissn=1532-0480&rft_id=info:doi/10.1016/j.jbi.2019.103265&rft_dat=%3Cproquest_pubme%3E2283106330%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2283106330&rft_id=info:pmid/31470094&rfr_iscdi=true