Loading…
Active deep learning for the identification of concepts and relations in electroencephalography reports
[Display omitted] •Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy i...
Saved in:
Published in: | Journal of biomedical informatics 2019-10, Vol.98, p.103265-103265, Article 103265 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3 |
---|---|
cites | cdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3 |
container_end_page | 103265 |
container_issue | |
container_start_page | 103265 |
container_title | Journal of biomedical informatics |
container_volume | 98 |
creator | Maldonado, Ramon Harabagiu, Sanda M. |
description | [Display omitted]
•Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset.
The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports. |
doi_str_mv | 10.1016/j.jbi.2019.103265 |
format | article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6922091</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1532046419301844</els_id><sourcerecordid>2283106330</sourcerecordid><originalsourceid>FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</originalsourceid><addsrcrecordid>eNp9UU1LAzEQDaL4_QO8SI5eWvO16S6CIOIXCF70HLLJbJuyTdYkLfjvTa0WvXjKhPfmzcx7CJ1RMqaEysv5eN66MSO0KX_OZLWDDmnF2YiImuxuaykO0FFKc0IorSq5jw44FRNCGnGIpjcmuxVgCzDgHnT0zk9xFyLOM8DOgs-uc0ZnFzwOHTbBGxhywtpbHKH_AhJ2HkMPJscAa3ym-zCNeph9FM4QYk4naK_TfYLT7_cYvd3fvd4-jp5fHp5ub55HRlQ0j6qOCdvUbFJXVVNLagShltlaM9Katms7WQ6HthUTNiGc80ZI2WhTA9eGkablx-h6ozss2wVYU_aPuldDdAsdP1TQTv1FvJupaVgp2bAiQIvAxbdADO9LSFktXDLQ99pDWCbFWM0pkZyTQqUbqokhpQjddgwlah2QmqsSkFoHpDYBlZ7z3_ttO34SKYSrDQGKSysHUSXj1qZaF4vBygb3j_wnA2yjGA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2283106330</pqid></control><display><type>article</type><title>Active deep learning for the identification of concepts and relations in electroencephalography reports</title><source>BACON - Elsevier - GLOBAL_SCIENCEDIRECT-OPENACCESS</source><source>ScienceDirect Journals</source><creator>Maldonado, Ramon ; Harabagiu, Sanda M.</creator><creatorcontrib>Maldonado, Ramon ; Harabagiu, Sanda M.</creatorcontrib><description>[Display omitted]
•Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset.
The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports.</description><identifier>ISSN: 1532-0464</identifier><identifier>EISSN: 1532-0480</identifier><identifier>DOI: 10.1016/j.jbi.2019.103265</identifier><identifier>PMID: 31470094</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Active learning ; Attribute classification ; Concept detection ; Deep learning ; Electroencephalography ; Long-distance relation identification</subject><ispartof>Journal of biomedical informatics, 2019-10, Vol.98, p.103265-103265, Article 103265</ispartof><rights>2019 Elsevier Inc.</rights><rights>Copyright © 2019 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</citedby><cites>FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,315,786,790,891,27957,27958</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/31470094$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Maldonado, Ramon</creatorcontrib><creatorcontrib>Harabagiu, Sanda M.</creatorcontrib><title>Active deep learning for the identification of concepts and relations in electroencephalography reports</title><title>Journal of biomedical informatics</title><addtitle>J Biomed Inform</addtitle><description>[Display omitted]
•Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset.
The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports.</description><subject>Active learning</subject><subject>Attribute classification</subject><subject>Concept detection</subject><subject>Deep learning</subject><subject>Electroencephalography</subject><subject>Long-distance relation identification</subject><issn>1532-0464</issn><issn>1532-0480</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9UU1LAzEQDaL4_QO8SI5eWvO16S6CIOIXCF70HLLJbJuyTdYkLfjvTa0WvXjKhPfmzcx7CJ1RMqaEysv5eN66MSO0KX_OZLWDDmnF2YiImuxuaykO0FFKc0IorSq5jw44FRNCGnGIpjcmuxVgCzDgHnT0zk9xFyLOM8DOgs-uc0ZnFzwOHTbBGxhywtpbHKH_AhJ2HkMPJscAa3ym-zCNeph9FM4QYk4naK_TfYLT7_cYvd3fvd4-jp5fHp5ub55HRlQ0j6qOCdvUbFJXVVNLagShltlaM9Katms7WQ6HthUTNiGc80ZI2WhTA9eGkablx-h6ozss2wVYU_aPuldDdAsdP1TQTv1FvJupaVgp2bAiQIvAxbdADO9LSFktXDLQ99pDWCbFWM0pkZyTQqUbqokhpQjddgwlah2QmqsSkFoHpDYBlZ7z3_ttO34SKYSrDQGKSysHUSXj1qZaF4vBygb3j_wnA2yjGA</recordid><startdate>20191001</startdate><enddate>20191001</enddate><creator>Maldonado, Ramon</creator><creator>Harabagiu, Sanda M.</creator><general>Elsevier Inc</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20191001</creationdate><title>Active deep learning for the identification of concepts and relations in electroencephalography reports</title><author>Maldonado, Ramon ; Harabagiu, Sanda M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Active learning</topic><topic>Attribute classification</topic><topic>Concept detection</topic><topic>Deep learning</topic><topic>Electroencephalography</topic><topic>Long-distance relation identification</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Maldonado, Ramon</creatorcontrib><creatorcontrib>Harabagiu, Sanda M.</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of biomedical informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Maldonado, Ramon</au><au>Harabagiu, Sanda M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Active deep learning for the identification of concepts and relations in electroencephalography reports</atitle><jtitle>Journal of biomedical informatics</jtitle><addtitle>J Biomed Inform</addtitle><date>2019-10-01</date><risdate>2019</risdate><volume>98</volume><spage>103265</spage><epage>103265</epage><pages>103265-103265</pages><artnum>103265</artnum><issn>1532-0464</issn><eissn>1532-0480</eissn><notes>ObjectType-Article-1</notes><notes>SourceType-Scholarly Journals-1</notes><notes>ObjectType-Feature-2</notes><notes>content type line 23</notes><abstract>[Display omitted]
•Transformer encoders enable end-to-end, accurate knowledge extraction from EEG reports.•Medical concepts, their attributes and relations between them can be extracted jointly.•Joint learning enables active learning policy that selects based on all 3 tasks.•Active learning policy itself can be learned with imitation learning and a seed dataset.
The identification of medical concepts, their attributes and the relations between concepts in a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. However, the recognition of multiple types of medical concepts, along with the many attributes characterizing them is challenging, and so is the recognition of the possible relations between them, especially when desiring to make use of active learning. To address these challenges, in this paper we present the Self-Attention Concept, Attribute and Relation (SACAR) identifier, which relies on a powerful encoding mechanism based on the recently introduced Transformer neural architecture (Dehghani et al., 2018). The SACAR identifier enabled us to consider a recently introduced framework for active learning which uses deep imitation learning for its selection policy. Our experimental results show that SACAR was able to identify medical concepts more precisely and exhibited enhanced recall, compared with previous methods. Moreover, SACAR achieves superior performance in attribute classification for attribute categories of interest, while identifying the relations between concepts with performance competitive with our previous techniques. As a multi-task network, SACAR achieves this performance on the three prediction tasks simultaneously, with a single, complex neural network. The learning curves obtained in the active learning process when using the novel Active Learning Policy Neural Network (ALPNN) show a significant increase in performance as the active learning progresses. These promising results enable the extraction of clinical knowledge available in a large collection of EEG reports.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>31470094</pmid><doi>10.1016/j.jbi.2019.103265</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1532-0464 |
ispartof | Journal of biomedical informatics, 2019-10, Vol.98, p.103265-103265, Article 103265 |
issn | 1532-0464 1532-0480 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6922091 |
source | BACON - Elsevier - GLOBAL_SCIENCEDIRECT-OPENACCESS; ScienceDirect Journals |
subjects | Active learning Attribute classification Concept detection Deep learning Electroencephalography Long-distance relation identification |
title | Active deep learning for the identification of concepts and relations in electroencephalography reports |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-09-22T02%3A27%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Active%20deep%20learning%20for%20the%20identification%20of%20concepts%20and%20relations%20in%20electroencephalography%20reports&rft.jtitle=Journal%20of%20biomedical%20informatics&rft.au=Maldonado,%20Ramon&rft.date=2019-10-01&rft.volume=98&rft.spage=103265&rft.epage=103265&rft.pages=103265-103265&rft.artnum=103265&rft.issn=1532-0464&rft.eissn=1532-0480&rft_id=info:doi/10.1016/j.jbi.2019.103265&rft_dat=%3Cproquest_pubme%3E2283106330%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c451t-5f24d98278559861c401d2d8a20bcbfbf6101ebb4727033394669ac8e3ac209b3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2283106330&rft_id=info:pmid/31470094&rfr_iscdi=true |