TY - JOUR AU - You, S. D. AU - Wu, Y. -. C. AU - Peng, S. -. H. PY - 2016 DA - 2016// TI - Comparative study of singing voice detection methods JO - Multimedia Tools Appl VL - 75 UR - https://doi.org/10.1007/s11042-015-2894-9 DO - 10.1007/s11042-015-2894-9 ID - You2016 ER - TY - JOUR AU - Hsu, C. -. L. AU - Wang, D. AU - Jang, J. S. R. AU - Hu, K. PY - 2012 DA - 2012// TI - A tandem algorithm for singing pitch extraction and voice separation from music accompaniment JO - IEEE Trans Audio Speech Lang Process VL - 20 UR - https://doi.org/10.1109/TASL.2011.2182510 DO - 10.1109/TASL.2011.2182510 ID - Hsu2012 ER - TY - STD TI - Logan B, Chu S (2000) Music summarization using key phrases. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing, 2000 ID - ref3 ER - TY - JOUR AU - Salamon, J. AU - Gómez, E. AU - Ellis, D. P. AU - Richard, G. PY - 2014 DA - 2014// TI - Melody extraction from polyphonic music signals: approaches, applications, and challenges JO - IEEE Signal Process Mag VL - 31 UR - https://doi.org/10.1109/MSP.2013.2271648 DO - 10.1109/MSP.2013.2271648 ID - Salamon2014 ER - TY - STD TI - Kim Y E, Whitman B (2002) Singer identification in popular music recordings using voice coding features. In: Proceedings of the 3rd international conference on music information retrieval, 2002 ID - ref5 ER - TY - STD TI - Berenzweig AL, Ellis DP (2001) Locating singing voice segments within music signals. In: IEEE workshop on the applications of signal processing to audio and acoustics, 2001 ID - ref6 ER - TY - STD TI - Lukashevich H, Gruhne M, Dittmar C (2007) Effective singing voice detection in popular music using arma filtering. In Workshop on Digital Audio Effects (DAFx’07), 2007 ID - ref7 ER - TY - JOUR AU - Song, Y. AU - Kim, I. PY - 2018 DA - 2018// TI - DeepAct: a deep neural network model for activity detection in untrimmed videos JO - J Inform Process Syst VL - 14 UR - https://doi.org/10.3745/JIPS.04.0059 DO - 10.3745/JIPS.04.0059 ID - Song2018 ER - TY - JOUR AU - Yu, N. AU - Yu, Z. AU - Gu, F. AU - Li, T. AU - Tian, X. AU - Pan, Y. PY - 2017 DA - 2017// TI - Deep learning in genomic and medical image data analysis: challenges and approaches JO - J Inform Process Syst VL - 13 UR - https://doi.org/10.3745/JIPS.04.0029 DO - 10.3745/JIPS.04.0029 ID - Yu2017 ER - TY - STD TI - Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, 2012 ID - ref10 ER - TY - JOUR AU - Koo, K. M. AU - Cha, E. Y. PY - 2017 DA - 2017// TI - Image recognition performance enhancements using image normalization JO - Human-centric Comput Inform Sci VL - 7 UR - https://doi.org/10.1186/s13673-017-0114-5 DO - 10.1186/s13673-017-0114-5 ID - Koo2017 ER - TY - JOUR AU - Davis, S. B. AU - Mermelstein, P. PY - 1980 DA - 1980// TI - Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences JO - IEEE Trans Acoust Speech Signal Process VL - 28 UR - https://doi.org/10.1109/TASSP.1980.1163420 DO - 10.1109/TASSP.1980.1163420 ID - Davis1980 ER - TY - STD TI - Dieleman S, Schrauwen B (2014) End-to-end learning for music audio. In: IEEE international conference on acoustics, speech and signal processing, 2014 ID - ref13 ER - TY - STD TI - Dai J, Liang S, Xue W, Ni C, Liu W (2016) Long short-term memory recurrent neural network based segment features for music genre classification. In: 10th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2016 ID - ref14 ER - TY - STD TI - Xingjian S H I, Chen Z, Wang H, Yeung D Y, Wong W K, Woo W C (2015) Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In: Advances in neural information processing systems, 2015 ID - ref15 ER - TY - STD TI - Sabour S, Frosst N, Hinton GE (2017) Dynamic routing between capsules. In: Advances in neural information processing systems, 2017 ID - ref16 ER - TY - STD TI - Kim H G, Sikora T (2004) Comparison of MPEG-7 audio spectrum projection features and MFCC applied to speaker recognition, sound classification and audio segmentation. In: IEEE international conference on acoustics, speech, and signal processing, 2004 ID - ref17 ER - TY - STD TI - Rocamora M, Herrera P (2007) Comparing audio descriptors for singing voice detection in music audio files. In: 11th Brazilian symposium on computer music, San Pablo, Brazil, 2007 ID - ref18 ER - TY - STD TI - Dittmar C, Lehner B, Prätzlich T, Müller M, Widmer G (2015) Cross-version singing voice detection in classical opera recordings. In: International society for music information retrieval conference (ISMIR), Malaga, Spain, 2015 ID - ref19 ER - TY - STD TI - Vembu S, Baumann S (2005) Separation of vocals from polyphonic audio recordings. In: 6th international conference on music information retrieval (ISMIR 2005), London, 2005 ID - ref20 ER - TY - STD TI - Nwe T L, Shenoy A, Wang Y (2004) Singing voice detection in popular music. In: Proceedings of the 12th annual ACM international conference on Multimedia, 2004 ID - ref21 ER - TY - STD TI - Leglaive S, Hennequin R, Badeau R (2015) Singing voice detection with deep recurrent neural networks. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015 ID - ref22 ER - TY - STD TI - Schlüter J, Grill T (2015) Exploring Data Augmentation for Improved Singing Voice Detection with Neural Networks. In: Proc. of the 16th International Society for Music Information Retrieval Conference, 2015 ID - ref23 ER - TY - STD TI - Humphrey E J, Bello J P, LeCun Y (2012) Moving Beyond Feature Design: Deep Architectures and Automatic Feature Learning in Music Informatics. In: Proceedings of the 13th International Society for Music Information Retrieval Conference, Porto, Portugal, 2012 ID - ref24 ER - TY - JOUR AU - Lee, J. AU - Park, J. AU - Kim, K. L. AU - Nam, J. PY - 2018 DA - 2018// TI - Samplecnn: end-to-end deep convolutional neural networks using very small filters for music classification JO - Appl Sci VL - 8 UR - https://doi.org/10.3390/app8010150 DO - 10.3390/app8010150 ID - Lee2018 ER - TY - JOUR AU - Lim, M. AU - Lee, D. AU - Park, H. AU - Kang, Y. AU - Oh, J. AU - Park, J. S. AU - Kim, J. H. PY - 2018 DA - 2018// TI - Convolutional neural network based audio event classification JO - KSII Trans Internet Inform Syst VL - 12 ID - Lim2018 ER - TY - STD TI - Huang HM, Chen WK, Liu CH, You SD (2018) Singing voice detection based on convolutional neural networks. In: 2018 7th international symposium on next generation electronics, Taipei, 2018 ID - ref27 ER - TY - STD TI - Wu Y C, Chang P C, Wang C Y, Wang J C (2017) A symmetrie kernel convolutional neural network for acoustic scenes classification. In: IEEE international symposium on consumer electronics, Kuala Lumpur, Malaysia, 2017 ID - ref28 ER - TY - STD TI - Available https://en.wikipedia.org/wiki/Spectrogram. Accessed 6 Sep 2018 UR - https://en.wikipedia.org/wiki/Spectrogram ID - ref29 ER - TY - STD TI - Ramona M, Richard G, David B (2008) Vocal detection in music with support vector machines. In: IEEE international conference on acoustics, speech and signal processing, 2008 ID - ref30 ER - TY - STD TI - Defferrard M, Benzi K, Vandergheynst P, Bresson X (2017) FMA: a dataset for music analysis. In: 18th international society for music information retrieval conference, 2017 ID - ref31 ER - TY - STD TI - Available: https://github.com/NTUT-LabASPL/FMA-C-DataSet-for-Vocal-Detection. Accessed 18 Oct 2018 UR - https://github.com/NTUT-LabASPL/FMA-C-DataSet-for-Vocal-Detection ID - ref32 ER - TY - STD TI - Available: https://www.tensorflow.org/. Accessed 6 Sept 2018 UR - https://www.tensorflow.org/ ID - ref33 ER - TY - STD TI - Available: https://keras.io/. Accessed 6 Sept 2018 UR - https://keras.io/ ID - ref34 ER - TY - STD TI - Zeiler MD (2012) ADADELTA: an adaptive learning rate method. In; arXiv preprint arXiv:1212.5701 UR - http://arxiv.org/abs/1212.5701 ID - ref35 ER - TY - JOUR AU - Srivastava, N. AU - Hinton, G. AU - Krizhevsky, A. AU - Sutskever, I. AU - Salakhutdinov, R. PY - 2014 DA - 2014// TI - Dropout: a simple way to prevent neural networks from overfitting JO - J Mach Learn Res VL - 15 ID - Srivastava2014 ER -