Loading…
Deep neural networks for audio scene recognition
These last years, artificial neural networks (ANN) have known a renewed interest since efficient training procedures have emerged to learn the so called deep neural networks (DNN), i.e. ANN with at least two hidden layers. In the same time, the computational auditory scene recognition (CASR) problem...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Citations: | Items that cite this one |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c306t-b53c020d3cefe46c890fc302dc4c7ec806b052a0b12ae7ba67b2a8c9d4c1b6373 |
---|---|
cites | |
container_end_page | 129 |
container_issue | |
container_start_page | 125 |
container_title | |
container_volume | |
creator | Petetin, Yohan Laroche, Cyrille Mayoue, Aurelien |
description | These last years, artificial neural networks (ANN) have known a renewed interest since efficient training procedures have emerged to learn the so called deep neural networks (DNN), i.e. ANN with at least two hidden layers. In the same time, the computational auditory scene recognition (CASR) problem which consists in estimating the environment around a device from the received audio signal has been investigated. Most of works which deal with the CASR problem have tried to ind well-adapted features for this problem. However, these features are generally combined with a classical classi-ier. In this paper, we introduce DNN in the CASR ield and we show that such networks can provide promising results and perform better than standard classiiers when the same features are used. |
doi_str_mv | 10.1109/EUSIPCO.2015.7362358 |
format | conference_proceeding |
fullrecord | <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_7362358</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7362358</ieee_id><sourcerecordid>7362358</sourcerecordid><originalsourceid>FETCH-LOGICAL-c306t-b53c020d3cefe46c890fc302dc4c7ec806b052a0b12ae7ba67b2a8c9d4c1b6373</originalsourceid><addsrcrecordid>eNotj9FKwzAUQKMgOGa_QB_6A503uc1N8ih1usFggu55JOmtBGc70g7x7x24p_Nw4MAR4kHCQkpwj8vd-_qt2S4USL0wSAq1vRKFMxacU5YUIV6LmQJDlaxJ34piHFMAhTVKIj0T8Mx8LHs-ZX84Y_oZ8tdYdkMu_alNQzlG7rnMHIfPPk1p6O_ETecPIxcXzsXuZfnRrKrN9nXdPG2qiEBTFTRGUNBi5I5ritZBdzaqjXU0HC1QAK08BKk8m-DJBOVtdG0dZSA0OBf3_93EzPtjTt8-_-4vj_gHjDBGDw</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Deep neural networks for audio scene recognition</title><source>IEEE Xplore All Conference Series</source><creator>Petetin, Yohan ; Laroche, Cyrille ; Mayoue, Aurelien</creator><creatorcontrib>Petetin, Yohan ; Laroche, Cyrille ; Mayoue, Aurelien</creatorcontrib><description>These last years, artificial neural networks (ANN) have known a renewed interest since efficient training procedures have emerged to learn the so called deep neural networks (DNN), i.e. ANN with at least two hidden layers. In the same time, the computational auditory scene recognition (CASR) problem which consists in estimating the environment around a device from the received audio signal has been investigated. Most of works which deal with the CASR problem have tried to ind well-adapted features for this problem. However, these features are generally combined with a classical classi-ier. In this paper, we introduce DNN in the CASR ield and we show that such networks can provide promising results and perform better than standard classiiers when the same features are used.</description><identifier>EISSN: 2076-1465</identifier><identifier>EISBN: 9780992862633</identifier><identifier>EISBN: 0992862639</identifier><identifier>DOI: 10.1109/EUSIPCO.2015.7362358</identifier><language>eng</language><publisher>EURASIP</publisher><subject>Artificial neural networks ; audio scene recognition ; Context ; deep belief networks ; Deep neural networks ; Europe ; Mel frequency cepstral coefficient ; Signal processing ; Training</subject><ispartof>2015 23rd European Signal Processing Conference (EUSIPCO), 2015, p.125-129</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c306t-b53c020d3cefe46c890fc302dc4c7ec806b052a0b12ae7ba67b2a8c9d4c1b6373</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7362358$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>310,311,786,790,795,796,27958,54906,55283</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/7362358$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Petetin, Yohan</creatorcontrib><creatorcontrib>Laroche, Cyrille</creatorcontrib><creatorcontrib>Mayoue, Aurelien</creatorcontrib><title>Deep neural networks for audio scene recognition</title><title>2015 23rd European Signal Processing Conference (EUSIPCO)</title><addtitle>EUSIPCO</addtitle><description>These last years, artificial neural networks (ANN) have known a renewed interest since efficient training procedures have emerged to learn the so called deep neural networks (DNN), i.e. ANN with at least two hidden layers. In the same time, the computational auditory scene recognition (CASR) problem which consists in estimating the environment around a device from the received audio signal has been investigated. Most of works which deal with the CASR problem have tried to ind well-adapted features for this problem. However, these features are generally combined with a classical classi-ier. In this paper, we introduce DNN in the CASR ield and we show that such networks can provide promising results and perform better than standard classiiers when the same features are used.</description><subject>Artificial neural networks</subject><subject>audio scene recognition</subject><subject>Context</subject><subject>deep belief networks</subject><subject>Deep neural networks</subject><subject>Europe</subject><subject>Mel frequency cepstral coefficient</subject><subject>Signal processing</subject><subject>Training</subject><issn>2076-1465</issn><isbn>9780992862633</isbn><isbn>0992862639</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2015</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotj9FKwzAUQKMgOGa_QB_6A503uc1N8ih1usFggu55JOmtBGc70g7x7x24p_Nw4MAR4kHCQkpwj8vd-_qt2S4USL0wSAq1vRKFMxacU5YUIV6LmQJDlaxJ34piHFMAhTVKIj0T8Mx8LHs-ZX84Y_oZ8tdYdkMu_alNQzlG7rnMHIfPPk1p6O_ETecPIxcXzsXuZfnRrKrN9nXdPG2qiEBTFTRGUNBi5I5ritZBdzaqjXU0HC1QAK08BKk8m-DJBOVtdG0dZSA0OBf3_93EzPtjTt8-_-4vj_gHjDBGDw</recordid><startdate>201508</startdate><enddate>201508</enddate><creator>Petetin, Yohan</creator><creator>Laroche, Cyrille</creator><creator>Mayoue, Aurelien</creator><general>EURASIP</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201508</creationdate><title>Deep neural networks for audio scene recognition</title><author>Petetin, Yohan ; Laroche, Cyrille ; Mayoue, Aurelien</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c306t-b53c020d3cefe46c890fc302dc4c7ec806b052a0b12ae7ba67b2a8c9d4c1b6373</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Artificial neural networks</topic><topic>audio scene recognition</topic><topic>Context</topic><topic>deep belief networks</topic><topic>Deep neural networks</topic><topic>Europe</topic><topic>Mel frequency cepstral coefficient</topic><topic>Signal processing</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>Petetin, Yohan</creatorcontrib><creatorcontrib>Laroche, Cyrille</creatorcontrib><creatorcontrib>Mayoue, Aurelien</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library Online</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Petetin, Yohan</au><au>Laroche, Cyrille</au><au>Mayoue, Aurelien</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Deep neural networks for audio scene recognition</atitle><btitle>2015 23rd European Signal Processing Conference (EUSIPCO)</btitle><stitle>EUSIPCO</stitle><date>2015-08</date><risdate>2015</risdate><spage>125</spage><epage>129</epage><pages>125-129</pages><eissn>2076-1465</eissn><eisbn>9780992862633</eisbn><eisbn>0992862639</eisbn><abstract>These last years, artificial neural networks (ANN) have known a renewed interest since efficient training procedures have emerged to learn the so called deep neural networks (DNN), i.e. ANN with at least two hidden layers. In the same time, the computational auditory scene recognition (CASR) problem which consists in estimating the environment around a device from the received audio signal has been investigated. Most of works which deal with the CASR problem have tried to ind well-adapted features for this problem. However, these features are generally combined with a classical classi-ier. In this paper, we introduce DNN in the CASR ield and we show that such networks can provide promising results and perform better than standard classiiers when the same features are used.</abstract><pub>EURASIP</pub><doi>10.1109/EUSIPCO.2015.7362358</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | EISSN: 2076-1465 |
ispartof | 2015 23rd European Signal Processing Conference (EUSIPCO), 2015, p.125-129 |
issn | 2076-1465 |
language | eng |
recordid | cdi_ieee_primary_7362358 |
source | IEEE Xplore All Conference Series |
subjects | Artificial neural networks audio scene recognition Context deep belief networks Deep neural networks Europe Mel frequency cepstral coefficient Signal processing Training |
title | Deep neural networks for audio scene recognition |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-09-21T19%3A41%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Deep%20neural%20networks%20for%20audio%20scene%20recognition&rft.btitle=2015%2023rd%20European%20Signal%20Processing%20Conference%20(EUSIPCO)&rft.au=Petetin,%20Yohan&rft.date=2015-08&rft.spage=125&rft.epage=129&rft.pages=125-129&rft.eissn=2076-1465&rft_id=info:doi/10.1109/EUSIPCO.2015.7362358&rft.eisbn=9780992862633&rft.eisbn_list=0992862639&rft_dat=%3Cieee_CHZPO%3E7362358%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c306t-b53c020d3cefe46c890fc302dc4c7ec806b052a0b12ae7ba67b2a8c9d4c1b6373%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=7362358&rfr_iscdi=true |