K2WebWizard

교수소개

교수

김우일

직위(직급): 교수

전화번호: 032-835-8459

이메일: wikim@inu.ac.kr

사이트: http://impress.inu.ac.kr

연구분야: 음성인식, 신호처리, 패턴인식, 인공지능

세부내용

<학력>

고려대학교 전자공학과 공학사

고려대학교 전자공학과 공학석사

고려대학교 전자공학과 공학박사

- 학위 논문: Model-based Feature Compensation for Robust Speech Recognition in Adverse Environments

<경력>

Post-doc Researcher, Carnegie Mellon University, Pittsburgh, PA, USA, 2004년 9월-2005년 8월

Research Associate, University of Texas at Dallas, Richardson, TX, USA, 2005년 9월-2007년 8월

Research Assistant Professor, University of Texas at Dallas, Richardson, TX, USA, 2007년 9월-2012년 8월

인천대학교 컴퓨터공학부 조교수/부교수/교수, 2012년 8월 – 현재

<논문 (Selected)>

DLF-EEND: Dynamic Layer Fusion for End-to-End Speaker Diarization, Interspeech 2025, pp. 1688-1692, Aug. 2025

A study on end-to-end speaker diarization system using single-label classification,Journal of the Acoustical Society of Korea ,제42권(집) ,제6호 ,2023.11.30

A study on speech enhancement using complex-valued spectrum employing Feature map Dependent attention gate,Journal of the Acoustical Society of Korea ,제42권(집) ,제6호 ,PP.544~551 ,2023.11.30

A study on skip-connection with time-frequency self-attention for improving speech enhancement based on complex-valued spectrum,Journal of the Acoustical Society of Korea ,제42권(집) ,제2호 ,PP.94~101 ,2023.03.31

A study on deep neural speech enhancement in drone noise environment,Journal of the Acoustical Society of Korea ,제41권(집) ,제3호 ,PP.342~350 ,2022.05.31

Class-GE2E: Speaker Verification Using Self-Attention and Transfer Learning with Loss Combination,Electronics ,제11권(집) ,제6호 ,2022.03.31

A study on loss combination in time and frequency for effective speech enhancement based on complex-valued spectrum,Journal of the Acoustical Society of Korea ,제41권(집) ,제1호 ,PP.38~44 ,2022.01.31

A study on combination of loss functions for effective mask-based speech enhancement in noisy environments,Journal of the Acoustical Society of Korea ,제40권(집) ,제3호 ,PP.234~240 ,2021.05.31

Speaker Verification Employing Combinations of Self-Attention Mechanisms,Electronics ,제9권(집) ,제12호 ,2020.12.21

Small-Footprint Wake Up Word Recognition in Noisy Environments Employing Competing-Words-Based Feature,Electronics ,제9권(집) ,제12호 ,2020.12.21

A Novel Discriminative Feature Extraction for Acoustic Scene Classification Using RNN Based Source Separation,IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS ,제E100-D권(집) ,제12호 ,PP.3041~3044 ,2017.12.01

DNN Transfer Learning Based Non-Linear Feature Extraction for Acoustic Event Classification,IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS ,제E100-D권(집) ,제9호 ,PP.2249~2252 ,2017.09.01

Relative transfer function (RTF) estimation utilising peaks in time-domain RTF,ELECTRONICS LETTERS ,제52권(집) ,제14호 ,PP.1264~1266 ,2016.07.07

Two-Microphone Generalized Sidelobe Canceller with Post-Filter Based Speech Enhancement in Composite Noise,ETRI JOURNAL ,제38권(집) ,제2호 ,PP.366~375 ,2016.04.01

Advanced parallel combined Gaussian mixture model based feature compensation integrated with iterative channel estimation,SPEECH COMMUNICATION ,제73권(집) ,PP.81~93 ,2015.10.01

A Novel Mask Estimation Method Employing Posterior-Based Representative Mean Estimate for Missing-Feature Speech Recognition,IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING ,제19권(집) ,제5호 ,PP.1434~1443 ,2011.07.01

Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise,SPEECH COMMUNICATION ,제53권(집) ,제4호 ,PP.451~464 ,2011.04.01

Mask Classification for Missing-Feature Reconstruction for Robust Speech Recognition with Unknown Background Noise,SPEECH COMMUNICATION ,제53권(집) ,제1호 ,PP.1~11 ,2011.01.01

Robust Emotional Stressed Speech Detection Using Weighted Frequency Subbands,EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING ,2011.01.01

Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions,IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING ,제18권(집) ,제8호 ,PP.2111~2120 ,2010.11.01

Automatic voice onset time detection for unvoiced stops (/p/,/t/,/k/) with application to accent classification,SPEECH COMMUNICATION ,제52권(집) ,제10호 ,PP.777~789 ,2010.10.01

Phonetic Distance Based Confidence Measure,IEEE SIGNAL PROCESSING LETTERS ,제17권(집) ,제2호 ,PP.117~120 ,2010.02.01