교수소개

김우일 사진

교수

김우일
직위(직급)
교수
전화번호
032-835-8459
이메일
wikim@inu.ac.kr
사이트
http://impress.inu.ac.kr
연구분야
음성인식, 신호처리, 패턴인식, 인공지능
세부내용

<학력>

고려대학교 전자공학과 공학사

고려대학교 전자공학과 공학석사

고려대학교 전자공학과 공학박사

- 학위 논문: Model-based Feature Compensation for Robust Speech Recognition in Adverse Environments


<경력>

Post-doc Researcher, Carnegie Mellon University, Pittsburgh, PA, USA, 2004년 9월-2005년 8월

Research Associate, University of Texas at Dallas, Richardson, TX, USA, 2005년 9월-2007년 8월

Research Assistant Professor, University of Texas at Dallas, Richardson, TX, USA, 2007년 9월-2012년 8월

인천대학교 컴퓨터공학부 조교수/부교수/교수, 2012년 8월 – 현재


<논문 (Selected)>

DLF-EEND: Dynamic Layer Fusion for End-to-End Speaker Diarization, Interspeech 2025, pp. 1688-1692, Aug. 2025

A study on end-to-end speaker diarization system using single-label classification,Journal of the Acoustical Society of Korea ,42() ,6,2023.11.30

A study on speech enhancement using complex-valued spectrum employing Feature map Dependent attention gate,Journal of the Acoustical Society of Korea ,42() ,6,PP.544~551 ,2023.11.30

A study on skip-connection with time-frequency self-attention for improving speech enhancement based on complex-valued spectrum,Journal of the Acoustical Society of Korea ,42() ,2,PP.94~101 ,2023.03.31

A study on deep neural speech enhancement in drone noise environment,Journal of the Acoustical Society of Korea ,41() ,3,PP.342~350 ,2022.05.31

Class-GE2E: Speaker Verification Using Self-Attention and Transfer Learning with Loss Combination,Electronics ,11() ,6,2022.03.31

A study on loss combination in time and frequency for effective speech enhancement based on complex-valued spectrum,Journal of the Acoustical Society of Korea ,41() ,1,PP.38~44 ,2022.01.31

A study on combination of loss functions for effective mask-based speech enhancement in noisy environments,Journal of the Acoustical Society of Korea ,40() ,3,PP.234~240 ,2021.05.31

Speaker Verification Employing Combinations of Self-Attention Mechanisms,Electronics ,9() ,12,2020.12.21

Small-Footprint Wake Up Word Recognition in Noisy Environments Employing Competing-Words-Based Feature,Electronics ,9() ,12,2020.12.21

A Novel Discriminative Feature Extraction for Acoustic Scene Classification Using RNN Based Source Separation,IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS ,E100-D() ,12,PP.3041~3044 ,2017.12.01

DNN Transfer Learning Based Non-Linear Feature Extraction for Acoustic Event Classification,IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS ,E100-D() ,9,PP.2249~2252 ,2017.09.01

Relative transfer function (RTF) estimation utilising peaks in time-domain RTF,ELECTRONICS LETTERS ,52() ,14,PP.1264~1266 ,2016.07.07

Two-Microphone Generalized Sidelobe Canceller with Post-Filter Based Speech Enhancement in Composite Noise,ETRI JOURNAL ,38() ,2,PP.366~375 ,2016.04.01

Advanced parallel combined Gaussian mixture model based feature compensation integrated with iterative channel estimation,SPEECH COMMUNICATION ,73() ,PP.81~93 ,2015.10.01

A Novel Mask Estimation Method Employing Posterior-Based Representative Mean Estimate for Missing-Feature Speech Recognition,IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING ,19() ,5,PP.1434~1443 ,2011.07.01

Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise,SPEECH COMMUNICATION ,53() ,4,PP.451~464 ,2011.04.01

Mask Classification for Missing-Feature Reconstruction for Robust Speech Recognition with Unknown Background Noise,SPEECH COMMUNICATION ,53() ,1,PP.1~11 ,2011.01.01

Robust Emotional Stressed Speech Detection Using Weighted Frequency Subbands,EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING ,2011.01.01

Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions,IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING ,18() ,8,PP.2111~2120 ,2010.11.01

Automatic voice onset time detection for unvoiced stops (/p/,/t/,/k/) with application to accent classification,SPEECH COMMUNICATION ,52() ,10,PP.777~789 ,2010.10.01

Phonetic Distance Based Confidence Measure,IEEE SIGNAL PROCESSING LETTERS ,17() ,2,PP.117~120 ,2010.02.01