Electrical Engineering
5282 papers in Electrical Engineering
Explore Subcategories
TITLE
DATE
VIEWS
Multitask learning for frame-level instrument recognition
By Yun-Ning Hung, Yi-An Chen, Yi-Hsuan Yang · ArXiv: 1811.01143
2019-02-19
0
Deep Segment Attentive Embedding for Duration Robust Speaker Verification
By Bin Liu, Shuai Nie, Yaping Zhang · ArXiv: 1811.00883
2018-11-05
0
Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks
By Emad M. Grais, Hagen Wierstorf, Dominic Ward · ArXiv: 1811.00454
2019-06-25
1
Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models
By Herman Kamper · ArXiv: 1811.00403
2019-04-16
1
End-to-end Models with auditory attention in Multi-channel Keyword Spotting
By Haitong Zhang, Junbo Zhang, Yujun Wang · ArXiv: 1811.00350
2018-11-06
0
Sequence-to-sequence Models for Small-Footprint Keyword Spotting
By Haitong Zhang, Junbo Zhang, Yujun Wang · ArXiv: 1811.00348
2018-11-02
0
Neural Music Synthesis for Flexible Timbre Control
By Jong Wook Kim, Rachel Bittner, Aparna Kumar · ArXiv: 1811.00223
2018-11-02
0
Low-Dimensional Bottleneck Features for On-Device Continuous Speech Recognition
By David B. Ramsay, Kevin Kilgour, Dominik Roblek · ArXiv: 1811.00006
2018-11-02
0
WaveGlow: A Flow-based Generative Network for Speech Synthesis
By Ryan Prenger, Rafael Valle, Bryan Catanzaro · ArXiv: 1811.00002
2018-11-02
0
On The Inductive Bias of Words in Acoustics-to-Word Models
By Hao Tang, James Glass · ArXiv: 1810.13407
2018-11-14
6
Introducing SPAIN (SParse Audio INpainter)
By Ondv{r}ej Mokry, Pavel Zaviv{s}ka, Pavel Rajmic · ArXiv: 1810.13137
2020-01-17
0
Bi-Directional Lattice Recurrent Neural Networks for Confidence Estimation
By Qiujia Li, Preben Ness, Anton Ragni · ArXiv: 1810.13024
2019-02-19
0
Scaling Speech Enhancement in Unseen Environments with Noise Embeddings
By Gil Keren, Jing Han, Bj"orn Schuller · ArXiv: 1810.12757
2018-10-31
0
Hypergraph based semi-supervised learning algorithms applied to speech recognition problem: a novel approach
By Loc Hoang Tran, Trang Hoang, Bui Hoang Nam Huynh · ArXiv: 1810.12743
2018-10-31
0
Audiovisual speaker conversion: jointly and simultaneously transforming facial expression and acoustic characteristics
By Fuming Fang, Xin Wang, Junichi Yamagishi · ArXiv: 1810.12730
2018-12-04
0
Feature Trajectory Dynamic Time Warping for Clustering of Speech Segments
By Lerato Lerato, Thomas Niesler · ArXiv: 1810.12722
2018-10-31
0
Sparse Gaussian Process Audio Source Separation Using Spectrum Priors in the Time-Domain
By Pablo A. Alvarado, Mauricio A. Alvarez, Dan Stowell · ArXiv: 1810.12679
2018-11-22
0
The Airbus Air Traffic Control speech recognition 2018 challenge: towards ATC automatic transcription and call sign detection
By Thomas Pellegrini, Jer^ome Farinas, Estelle Delpech · ArXiv: 1810.12614
2020-03-11
0
Nonlinear Prediction of Multidimensional Signals via Deep Regression with Applications to Image Coding
By Xi Zhang, Xiaolin Wu · ArXiv: 1810.12568
2018-10-31
0
Almost-unsupervised Speech Recognition with Close-to-zero Resource Based on Phonetic Structures Learned from Very Small Unpaired Speech and Text Data
By Yi-Chen Chen, Chia-Hao Shen, Sung-Feng Huang · ArXiv: 1810.12566
2018-10-31
0
In-Silico Proportional-Integral Moment Control of Stochastic Gene Expression
By Corentin Briat, Mustafa Khammash · ArXiv: 1810.12293
2020-04-24
0
A Proper version of Synthesis-based Sparse Audio Declipper
By Pavel Zaviv{s}ka, Pavel Rajmic, Ondv{r}ej Mokry · ArXiv: 1810.12204
2020-01-17
0
Audio inpainting of music by means of neural networks
By Andres Marafioti, Nicki Holighaus, Piotr Majdak · ArXiv: 1810.12138
2022-02-21
1
A Scalable Pipelined Dataflow Accelerator for Object Region Proposals on FPGA Platform
By Wenzhi Fu, Jianlei Yang, Pengcheng Dai · ArXiv: 1810.12137
2018-10-30
0
An improved hybrid CTC-Attention model for speech recognition
By Zhe Yuan, Zhuoran Lyu, Jiwei Li · ArXiv: 1810.12020
2018-11-02
0
Improved multipath time delay estimation using cepstrum subtraction
By Eric L. Ferguson, Stefan B. Williams, Craig T. Jin · ArXiv: 1810.11990
2018-10-30
0
STFT spectral loss for training a neural speech waveform model
By Shinji Takaki, Toru Nakashika, Xin Wang · ArXiv: 1810.11945
2018-10-31
1
Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection
By Yu-Han Shen, Ke-Xin He, Wei-Qiang Zhang · ArXiv: 1810.11939
2025-05-06
2
Robust Audio Adversarial Example for a Physical Attack
By Hiromu Yakura, Jun Sakuma · ArXiv: 1810.11793
2019-08-20
0
Short-segment heart sound classification using an ensemble of deep convolutional neural networks
By Fuad Noman, Chee-Ming Ting, Sh-Hussain Salleh · ArXiv: 1810.11573
2020-04-27
0