Journal
|
2025 |
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition
김민수 IEEE Transactions on Pattern Analysis and Machine Intelligence, v.47, no.2, pp.1042-1055 |
2 |
원문
|
Journal
|
2024 |
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model
여정훈 IEEE Transactions on Multimedia, v.26, pp.6462-6474 |
8 |
원문
|
Journal
|
2024 |
Multimodal audiovisual speech recognition architecture using a three‐feature multi‐fusion method for noise‐robust systems
여도현 ETRI Journal, v.46, no.1, pp.22-34 |
4 |
원문
|
Journal
|
2024 |
Multimodal audiovisual speech recognition architecture using a three‐feature multi‐fusion method for noise‐robust systems
Jeon Sanghun ETRI Journal, v.46, no.1, pp.22-34 |
4 |
원문
|
Journal
|
2024 |
Multimodal audiovisual speech recognition architecture using a three‐feature multi‐fusion method for noise‐robust systems
이지은 ETRI Journal, v.46, no.1, pp.22-34 |
4 |
원문
|
Journal
|
2024 |
AI‐based language tutoring systems with end‐to‐end automatic speech recognition and proficiency evaluation
Kang Byung Ok ETRI Journal, v.46, no.1, pp.48-58 |
10 |
원문
|
Journal
|
2024 |
Spoken‐to‐written text conversion for enhancement of Korean–English readability and machine translation
Choi Hyunjung ETRI Journal, v.46, no.1, pp.127-136 |
3 |
원문
|
Journal
|
2024 |
Alzheimer's Disease Recognition from Spontaneous Speech Using Large Language Models
Bang Jeonguk ETRI Journal, v.46, no.1, pp.96-105 |
7 |
원문
|
Journal
|
2024 |
Joint streaming model for backchannel prediction and automatic speech recognition
최용석 ETRI Journal, v.46, no.1, pp.118-126 |
1 |
원문
|
Conference
|
2023 |
Improving Korean Children's English Speech Recognition Performance via Transfer Learning on Speech Log Data
Yoonhyung Kim 대한전자공학회 학술 대회 (추계) 2023, pp.552-555 |
|
|
Conference
|
2023 |
A Study on the Implementation of User Web/APP Based Reading Ability Assessment Service
Hong Yeon Yu International Conference on Consumer Electronics (ICCE) 2023 : Asia, pp.350-352 |
0 |
원문
|
Journal
|
2023 |
대형 사전훈련 모델의 파인튜닝을 통한강건한 한국어 음성인식 모델 구축
Oh Changhan 말소리와 음성과학, v.15, no.3, pp.75-82 |
|
원문
|
Conference
|
2022 |
An empirical study on semi-supervised transfer learning schemes for out-of-domain application of wav2vec 2.0
Yoonhyung Kim International Congress on Acoustics (ICA) 2022, pp.1-6 |
|
|
Conference
|
2022 |
CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
김태호 European Conference on Computer Vision (ECCV) 2022 (LNCS 13680), pp.651-667 |
3 |
원문
|
Conference
|
2022 |
End-to-End ASR semi-supervised training using adversarial self-training
Chung Hoon International Congress on Acoustics (ICA) 2022, pp.1-8 |
|
|
Conference
|
2022 |
A Data-Augmented Transfer Learning Method for the Speech Recognition in Domains with Sparse Speech Data
Kang Byung Ok International Congress on Acoustics (ICA) 2022, pp.1-5 |
|
|
Journal
|
2022 |
Artificial Intelligence Applications on Mobile Telecommunication Systems
Yeh Choongil 전자통신동향분석, v.37, no.4, pp.60-69 |
|
원문
|
Journal
|
2022 |
Fast offline transformer‐based end‐to‐end automatic speech recognition for real‐world applications
Yoo Rhee Oh ETRI Journal, v.44, no.3, pp.476-490 |
4 |
원문
|
Conference
|
2021 |
On-device Streaming Transformer-based End-to-End Speech Recognition
Yoo Rhee Oh International Speech Communication Association (INTERSPEECH) 2021, pp.1-2 |
1 |
|
Journal
|
2021 |
Multimodal Unsupervised Speech Translation for Recognizing and Evaluating Second Language Speech
Lee Yun Kyung Applied Sciences, v.11, no.6, pp.1-17 |
4 |
원문
|
Journal
|
2021 |
Phonetic Variation Modeling and a Language Model Adaptation for Korean English Code-Switching Speech Recognition
Lee Damheo Applied Sciences, v.11, no.6, pp.1-14 |
5 |
원문
|
Journal
|
2021 |
Survey of Recent Research in Education based on Artificial Intelligence
Jeon Hyung-Bae 전자통신동향분석, v.36, no.1, pp.71-80 |
|
원문
|
Journal
|
2021 |
Integrating Dilated Convolution into DenseLSTM for Audio Source Separation
허운행 Applied Sciences, v.11, no.2, pp.1-19 |
8 |
원문
|
Journal
|
2020 |
Automatic proficiency assessment of Korean speech read aloud by non‐natives using bidirectional LSTM‐based speech recognition
Yoo Rhee Oh ETRI Journal, v.42, no.5, pp.761-772 |
16 |
원문
|
Journal
|
2020 |
KsponSpeech: Korean Spontaneous Speech Corpus for Automatic Speech Recognition
Bang Jeonguk Applied Sciences, v.10, no.19, pp.1-17 |
44 |
원문
|
Conference
|
2020 |
Multi-Scale Multi-Band Dilated DenseLSTM for Robust Recognition of Speech with Background Music
허운행 International Conference on Information and Communication Technology Convergence (ICTC) 2020, pp.1238-1241 |
0 |
원문
|
Journal
|
2020 |
Speech Recognition for Task Domains with Sparse Matched Training Data
Kang Byung Ok Applied Sciences, v.10, no.18, pp.1-15 |
4 |
원문
|
Journal
|
2020 |
Text-driven Speech Animation with Emotion Control
Chae Won Seok KSII Transactions on Internet and Information Systems, v.14, no.8, pp.3473-3487 |
1 |
원문
|
Conference
|
2020 |
Semi-supervised Training for Sequence-to-Sequence Speech Recognition Using Reinforcement Learning
Chung Hoon International Joint Conference on Neural Networks (IJCNN) 2020, pp.1-6 |
10 |
원문
|
Journal
|
2020 |
Online Speech Recognition Using Multichannel Parallel Acoustic Score Computation and Deep Neural Network (DNN)- Based Voice-Activity Detector
Kiyoung Park Applied Sciences, v.10, no.12, pp.1-21 |
9 |
원문
|
Journal
|
2020 |
Online Speech Recognition Using Multichannel Parallel Acoustic Score Computation and Deep Neural Network (DNN)- Based Voice-Activity Detector
Yoo Rhee Oh Applied Sciences, v.10, no.12, pp.1-21 |
9 |
원문
|
Journal
|
2020 |
Semi-Supervised Speech Recognition Acoustic Model Training Using Policy Gradient
Chung Hoon Applied Sciences, v.10, no.10, pp.1-13 |
4 |
원문
|
Journal
|
2020 |
Honeycomb-like MoS2 Nanotube Array-Based Wearable Sensors for Noninvasive Detection of Human Skin Moisture
Kim Seong Jun ACS Applied Materials & Interfaces, v.12, no.14, pp.17029-17038 |
75 |
원문
|
Journal
|
2020 |
Honeycomb-like MoS2 Nanotube Array-Based Wearable Sensors for Noninvasive Detection of Human Skin Moisture
Shuvra Mondal ACS Applied Materials & Interfaces, v.12, no.14, pp.17029-17038 |
75 |
원문
|
Conference
|
2019 |
A Korean Automatic Speech Recognition for Non-native Speakers by using Bidirectional LSTM-based Acoustic Model with the Augmented Speech Data
Yoo Rhee Oh Seoul International Conference on Speech Sciences (SICSS) 2019, pp.156-156 |
|
|
Conference
|
2019 |
End-to-end Korean Digits Speech Recognition
Noh Jong-Hyouk International Conference on Information and Communication Technology Convergence (ICTC) 2019, pp.1137-1139 |
1 |
원문
|
Conference
|
2019 |
Depth Attention Net
Hyejin S. Kim International Conference on Information and Communication Technology Convergence (ICTC) 2019, pp.1110-1112 |
0 |
원문
|
Conference
|
2019 |
A Preliminary Study on Topical Model for Multi-domain Speech Recognition via Word Embedding Vector
Jihye Moon International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC) 2019, pp.1-4 |
3 |
원문
|
Journal
|
2019 |
Fast Speaker Adaptation using Extended Diagonal Linear Transformation for Deep Neural Networks
Kim Dong Hyun ETRI Journal, v.41, no.1, pp.109-116 |
2 |
원문
|
Conference
|
2018 |
High-Degree Feature for Deep Neural Network based Acoustic Model
Chung Hoon Workshop on Spoken Language Technology (SLT) 2018, pp.1-5 |
0 |
원문
|
Conference
|
2018 |
Hypo and Hyperarticulated Speech Data Augmentation for Spontaneous Speech Recognition
Sung Joo Lee European Signal Processing Conference (EUSIPCO) 2018, pp.2094-2098 |
1 |
원문
|
Conference
|
2018 |
General Labelled Data Generator Framework for Network Machine Learning
Kim Kwihoon International Conference on Advanced Communications Technology (ICACT) 2018, pp.1-5 |
5 |
원문
|
Conference
|
2017 |
Arabic Speech Recognition for Automatic Translation
Yeojeong Kim Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA) 2017, pp.262-265 |
|
|
Conference
|
2017 |
Deep-Learning Based Automatic Spontaneous Speech Assessment in a Data-Driven Approach for the 2017 SLaTE CALL Shared Challenge
Yoo Rhee Oh Speech and Language Technology in Education (SLaTE) Workshop 2017, pp.111-116 |
|
|
Journal
|
2017 |
DNN-based Acoustic Modeling for Speech Recognition of Native and Foreign Speakers
Kang Byung Ok 말소리와 음성과학, v.9, no.2, pp.95-101 |
|
원문
|
Journal
|
2016 |
Online Blind Channel Normalization Using BPF-Based Modulation Frequency Filtering
Lee Yun Kyung ETRI Journal, v.38, no.6, pp.1190-1196 |
0 |
원문
|
Conference
|
2016 |
Performance Improvement of the Conversational Speech Recognition System using Deep Neural Network in a Car Navigation System
Woo Yong Choi International Conference on Engineering, Technology, and Applied Science (ICETA) 2016 (Fall), pp.1-7 |
|
|
Conference
|
2016 |
I-vector Based Utterance Verification for Large-Vocabulary Speech Recognition System
Woo Yong Choi International Conference on Computer Communication and the Internet (ICCCI) 2016, pp.316-319 |
4 |
원문
|
Journal
|
2016 |
다국어 자동 통번역을 위한 공통 변환 기반 하이브리드 자동 번역 방법
Choi Sung Kwon 통번역학연구, v.20, no.3, pp.121-136 |
|
|
Journal
|
2016 |
Language Model Adaptation Based on Topic Probability of Latent Dirichlet Allocation
Jeon Hyung-Bae ETRI Journal, v.38, no.3, pp.487-493 |
16 |
원문
|
Journal
|
2016 |
Implementation of CNN in the View of Mini-batch DNN Training for Efficient Second Order Optimization
Hwa Jeon Song 말소리와 음성과학, v.8, no.2, pp.22-30 |
|
원문
|
Journal
|
2016 |
Combining Multiple Acoustic Models in GMM Spaces for Robust Speech Recognition
Kang Byung Ok IEICE Transactions on Information and Systems, v.E99.D, no.3, pp.724-730 |
5 |
원문
|
Conference
|
2016 |
Real-Time Personalized Facial Expression Recognition System based on Deep Learning
Injae Lee International Conference on Consumer Electronics (ICCE) 2016, pp.267-268 |
17 |
원문
|
Conference
|
2015 |
Feature Extraction Method for CD-DNN-HMM Based Speech Recognition Systems
Sung Joo Lee International Conference on Speech Sciences (ICSS) 2015, pp.235-236 |
|
|
Conference
|
2015 |
Error Analysis and Improvements in Korean Atypical Spontaneous Speech Recognition
Kang Byung Ok International Conference on Speech Sciences (ICSS) 2015, pp.241-242 |
|
|
Conference
|
2015 |
The Hardware Accelerator of The Automatic Speech Recognition for The Continuous Korean Words
Kim Ju-Yeob International SoC Design Conference (ISOCC) 2015, pp.1-2 |
|
|
Conference
|
2015 |
A Useful Feature-Engineering Approach for a LVCSR System Based on CD-DNN-HMM Algorithm
Sung Joo Lee European Signal Processing Conference (EUSIPCO) 2015, pp.1436-1440 |
|
|
Journal
|
2015 |
Multimodal Interface Based on Novel HMI UI/UX for In-Vehicle Infotainment System
Jinwoo Kim ETRI Journal, v.37, no.4, pp.793-803 |
25 |
원문
|
Conference
|
2015 |
A Fully-Hardwired Implementation of Large Vocabulary Continuous Speech Recognizer
Yunjoo Kim International Symposium on Consumer Electronics (ISCE) 2015, pp.1-2 |
0 |
원문
|
Conference
|
2015 |
Effective Voice Data Processing Techniques for Speech Recognition in Client/Server Environment
Kim Mi-Kyoung International Conference on Small and Medium Business (ICSMB) 2015, pp.284-287 |
|
|
Conference
|
2014 |
Noise Robust Feature for Automatic Speech Recognition based on Mel-spectrogram Gradient Histogram
Park Tae Jin Workshop on Speech, Language and Audio in Multimedia (SLAM) 2014, pp.1-5 |
|
|
Journal
|
2014 |
Multilingual Speech-to-Speech Translation System for Mobile Consumer Devices
Yun Seung IEEE Transactions on Consumer Electronics, v.60, no.3, pp.508-516 |
21 |
원문
|
Journal
|
2014 |
Intra- and Inter-Frame Features for Automatic Speech Recognition
Sung Joo Lee ETRI Journal, v.36, no.3, pp.514-517 |
14 |
원문
|
Conference
|
2013 |
Noise Robust Spontaneous Speech Recognition Using Multi-Space GMM
Kang Byung Ok International Congress and Exposition on Noise Control Engineering (Inter-Noise) 2013, pp.1-4 |
|
|
Conference
|
2013 |
A Robust Endpoint Detection Algorithm for the Speech Recognition in Noisy Environments
Kiyoung Park International Congress and Exposition on Noise Control Engineering (Inter-Noise) 2013, pp.1-6 |
|
|
Conference
|
2012 |
Performance Improvement of GSC Algorithms by Near Channel Subtraction-Based Blocking Matrix
박상준 International Conference on Ubiquitous Robots and Ambient Intelligence (URAI) 2012, pp.633-635 |
1 |
원문
|
Conference
|
2012 |
Frame-Level Selective Decoding Using Native and Non-native Acoustic Models for Robust Speech Recognition to Native and Non-native Speech
Yoo Rhee Oh International Workshop on Spoken Dialogue Systems (IWSDS) 2012, pp.269-274 |
|
원문
|
Conference
|
2012 |
Robust Speech Recognition based on Emphasis Filtering on Formant Regions in Mobile Noise Environment
Hwa Jeon Song International Congress and Exposition on Noise Control Engineering (Inter Noise) 2012, pp.1-7 |
|
|
Conference
|
2012 |
Lattice Rescoring for Speech Recognition Using Large Scale Distributed Language Models
Euisok Chung International Conference on Computational Linguistics (COLING) 2012, pp.217-224 |
|
|
Journal
|
2012 |
Speech Recognition Based Pronunciation Evaluation Using Pronunciation Variations and Anti-models for Non-native Language Learners
Yoo Rhee Oh Advanced Information Technology in Education, v.126, pp.345-352 |
|
|
Journal
|
2011 |
Efficient Spectrum Estimation of Noise using Line Spectral Pairs for Robust Speech Recognition
장길진 Electronics Letters, v.47, no.25, pp.1399-1401 |
8 |
원문
|
Conference
|
2011 |
Zero-Crossing-Based Channel Attentive Weighting of Cepstral Features for Robust Speech Recognition: The ETRI 2011 CHiME Challenge System
Kim Youngik International Speech Communication Association (INTERSPEECH) 2011, pp.1649-1652 |
|
|
Conference
|
2010 |
Performance Evaluation of Speech Recognition for Indoor Service Robots
Miyoung Cho International Conference on Ubiquitous Robots and Ambient Intelligence (URAI) 2010, pp.468-470 |
|
|
Journal
|
2010 |
Statistical Model-Based Noise Reduction Approach for Car Interior Applications to Speech Recognition
Sung Joo Lee ETRI Journal, v.32, no.5, pp.801-809 |
17 |
원문
|
Journal
|
2010 |
A New Distance Measure for a Variable-Sized Acoustic Model Based on MDL Technique
Cho Hoon-Young ETRI Journal, v.32, no.5, pp.795-800 |
2 |
원문
|
Conference
|
2010 |
SNR-Based Mask Compensation for Computational Auditory Scene Analysis Applied to Speech Recognition in a Car Environment
박지훈 International Speech Communication Association (INTERSPEECH) 2010, pp.725-728 |
|
|
Conference
|
2010 |
A Unified Approach of Compensation and Soft Masking Incorporating a Statistical Model into the Wiener Filter
Kang Byung Ok International Congress on Acoustics (ICA) 2010, pp.1-4 |
|
|
Conference
|
2009 |
Speech Enhancement Using Geometric Source Separation in POMI Robot
Hyejin S. Kim International Conference on Ubiquitous Robots and Ambient Intelligence (URAI) 2009, pp.1-2 |
|
|
Conference
|
2009 |
Intuitive Control Using a Mediated Interface Module
Kang Sang Seung International Conference on Advances in Computer Entertainment Technology (ACE) 2009, pp.435-436 |
1 |
원문
|
Conference
|
2009 |
A Commercial Car Navigation System using Korean Large Vocabulary Automatic Speech Recognizer
Sung Joo Lee Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2009, pp.286-289 |
|
|
Conference
|
2009 |
Word Boundary Unconstraint Viterbi Search For Robust Speech Recognition
Chung Hoon Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2009, pp.627-630 |
|
|
Conference
|
2009 |
Human-Robot Interface Using Robust Speech Recognition and User Localization Based on Noise Separation Device
Kiyoung Park International Symposium on Robot and Human Interactive Communication (RO-MAN) 2009, pp.328-333 |
5 |
원문
|
Conference
|
2009 |
Fast Speech Recognition for Voice Destination Entry in a Car Navigation System
Chung Hoon International Speech Communication Association (INTERSPEECH) 2009, pp.975-978 |
|
|
Conference
|
2009 |
An Overview of Korean-English Speech-to-Speech Translation System
Ilbin Lee Workshop on Technologies and Corpora for Asia-Pacific Speech Translation (TCAST) 2009, pp.1-4 |
|
|
Conference
|
2008 |
Multi-Modal Fusion of Speech-Gesture Using Integrated Probability Density Distribution
Lee Jigeun International Symposium on Intelligent Information Technology Application (IITA) 2008, pp.361-364 |
0 |
원문
|
Conference
|
2008 |
Development of Recognition System Using Fusion of Natural Gesture/Speech
Jung Young-Giu International Conference on Consumer Electronics (ICCE) 2008, pp.1-2 |
3 |
원문
|
Conference
|
2008 |
Using Confidence Vector in Multi-Stage Speech Recognition
Jeon Hyung-Bae International Joint Conference on Natural Language Processing (IJCNLP) 2008, pp.1-5 |
|
|
Conference
|
2007 |
Network-based Voice Component FrameWork for Human Robot Interaction
Hyejin S. Kim International Symposium on Communications and Information Technologies (ISCIT) 2007, pp.1546-1550 |
1 |
원문
|
Journal
|
2007 |
Multi-stage Speech Recognition Using Confidence Vector
Jeon Hyung-Bae 대한음성학회지 : 말소리, v.63, pp.113-124 |
|
|
Conference
|
2007 |
Preventing an External Acoustic Noise from being Misrecognized as a Speech Recognition Object by Confirming the Lip Movement Image Signal
Soo-Jong Lee International Speech Communication Association (INTERSPEECH) 2007, pp.718-721 |
|
|
Conference
|
2007 |
Speech Activity Detection with Lip Movement Image Signals
Soo-Jong Lee Pacific Rim Conference on Communications, Computers and signal Processing (PACRIM) 2007, pp.403-406 |
1 |
원문
|
Conference
|
2007 |
Discriminative Noise Adaptive Training Approach for an Environment Migration
Kang Byung Ok International Speech Communication Association (INTERSPEECH) 2007, pp.2085-2088 |
|
|
Conference
|
2007 |
A Case study of Edutainment Robot: Applying Voice Question Answering to Intelligent Robot
Oh Hyo-Jung International Symposium on Robot and Human Interactive Communication (RO-MAN) 2007, pp.410-415 |
7 |
원문
|
Conference
|
2007 |
Speaker Identification and Verification for Intelligent Service Robots
Keun-Chang Kwak International Conference on Artifical Intelligence (ICAI) 2007, pp.1-5 |
|
|
Journal
|
2007 |
Adaptive Channel Normalization Based on Infomax Algorithm for Robust Speech Recognition
Jung Ho Young ETRI Journal, v.29, no.3, pp.300-304 |
1 |
원문
|
Journal
|
2007 |
Development of an Optimized Feature Extraction Algorithm for Throat Signal Analysis
Jung Young-Giu ETRI Journal, v.29, no.3, pp.292-299 |
9 |
원문
|
Conference
|
2007 |
Performance Analysis of Adaptive Interlocutory and Noisy Signal through IP Networks
Kim Jin Sul AIAA International Communications Satellite Systems Conference (ICSSC) 2007, pp.1-9 |
0 |
|
Conference
|
2007 |
Data-Driven Subvector Clustering using the Cross-Entropy Method
정규준 International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2007, pp.IV977-IV980 |
3 |
원문
|
Journal
|
2006 |
Frame Reliability Weighting for Robust Recognition of Partially Corrupted Speech
Cho Hoon-Young Electronics Letters, v.42, no.25, pp.1487-1488 |
0 |
원문
|
Conference
|
2006 |
Speech-based Human-Robot Interaction Components for URC Intelligent Service Robots
Keun-Chang Kwak IEEE/RSJ International Conference on Intelligent Robots and Systems 2006, pp.1-1 |
|
원문
|
Conference
|
2006 |
Speaker's Gender Identification For Human Robot Interaction
Bae Kyung Sook International Conference on Signal Processing and Multimedia Applications (SIGMAP) 2006, pp.339-342 |
|
|
Journal
|
2006 |
Memory Efficient and Fast Speech Recognition System for LowResource Mobile Devices
Chung Hoon IEEE Transactions on Consumer Electronics, v.52, no.3, pp.792-796 |
8 |
원문
|
Conference
|
2006 |
A Development of Human-Robot Interaction Components for URC Intelligent Service Robots
Keun-Chang Kwak IEEE International Conference on Robotics and Automation (ICRA) 2006, pp.1-3 |
|
|
Journal
|
2004 |
Filtering of Filter-Bank Energies for Robust Speech Recognition
Jung Ho Young ETRI Journal, v.26, no.3, pp.273-276 |
7 |
원문
|
Journal
|
2004 |
Feature compensation based on soft decision
김남수 IEEE Signal Processing Letters, v.11, no.3, pp.378-381 |
10 |
원문
|
Conference
|
2001 |
Content-Based News Video Retrieval with Closed Captions and Time Alignment
222001 Pacific Rim Conference on Multimedia (PCM) 2001 (LNCS 2195), pp.879-884 |
0 |
원문
|