Electrical Engineering

5282 papers in Electrical Engineering

Explore Subcategories

Eess Sp

73 Papers

TITLE

DATE

VIEWS

Multitask learning for frame-level instrument recognition

By Yun-Ning Hung, Yi-An Chen, Yi-Hsuan Yang · ArXiv: 1811.01143

2019-02-19

Deep Segment Attentive Embedding for Duration Robust Speaker Verification

By Bin Liu, Shuai Nie, Yaping Zhang · ArXiv: 1811.00883

2018-11-05

Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks

By Emad M. Grais, Hagen Wierstorf, Dominic Ward · ArXiv: 1811.00454

2019-06-25

Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models

By Herman Kamper · ArXiv: 1811.00403

2019-04-16

End-to-end Models with auditory attention in Multi-channel Keyword Spotting

By Haitong Zhang, Junbo Zhang, Yujun Wang · ArXiv: 1811.00350

2018-11-06

Sequence-to-sequence Models for Small-Footprint Keyword Spotting

By Haitong Zhang, Junbo Zhang, Yujun Wang · ArXiv: 1811.00348

2018-11-02

Neural Music Synthesis for Flexible Timbre Control

By Jong Wook Kim, Rachel Bittner, Aparna Kumar · ArXiv: 1811.00223

2018-11-02

Low-Dimensional Bottleneck Features for On-Device Continuous Speech Recognition

By David B. Ramsay, Kevin Kilgour, Dominik Roblek · ArXiv: 1811.00006

2018-11-02

WaveGlow: A Flow-based Generative Network for Speech Synthesis

By Ryan Prenger, Rafael Valle, Bryan Catanzaro · ArXiv: 1811.00002

2018-11-02

On The Inductive Bias of Words in Acoustics-to-Word Models

By Hao Tang, James Glass · ArXiv: 1810.13407

2018-11-14

Introducing SPAIN (SParse Audio INpainter)

By Ondv{r}ej Mokry, Pavel Zaviv{s}ka, Pavel Rajmic · ArXiv: 1810.13137

2020-01-17

Bi-Directional Lattice Recurrent Neural Networks for Confidence Estimation

By Qiujia Li, Preben Ness, Anton Ragni · ArXiv: 1810.13024

2019-02-19

Scaling Speech Enhancement in Unseen Environments with Noise Embeddings

By Gil Keren, Jing Han, Bj"orn Schuller · ArXiv: 1810.12757

2018-10-31

Hypergraph based semi-supervised learning algorithms applied to speech recognition problem: a novel approach

By Loc Hoang Tran, Trang Hoang, Bui Hoang Nam Huynh · ArXiv: 1810.12743

2018-10-31

Audiovisual speaker conversion: jointly and simultaneously transforming facial expression and acoustic characteristics

By Fuming Fang, Xin Wang, Junichi Yamagishi · ArXiv: 1810.12730

2018-12-04

Feature Trajectory Dynamic Time Warping for Clustering of Speech Segments

By Lerato Lerato, Thomas Niesler · ArXiv: 1810.12722

2018-10-31

Sparse Gaussian Process Audio Source Separation Using Spectrum Priors in the Time-Domain

By Pablo A. Alvarado, Mauricio A. Alvarez, Dan Stowell · ArXiv: 1810.12679

2018-11-22

The Airbus Air Traffic Control speech recognition 2018 challenge: towards ATC automatic transcription and call sign detection

By Thomas Pellegrini, Jer^ome Farinas, Estelle Delpech · ArXiv: 1810.12614

2020-03-11

Nonlinear Prediction of Multidimensional Signals via Deep Regression with Applications to Image Coding

By Xi Zhang, Xiaolin Wu · ArXiv: 1810.12568

2018-10-31

Almost-unsupervised Speech Recognition with Close-to-zero Resource Based on Phonetic Structures Learned from Very Small Unpaired Speech and Text Data

By Yi-Chen Chen, Chia-Hao Shen, Sung-Feng Huang · ArXiv: 1810.12566

2018-10-31

In-Silico Proportional-Integral Moment Control of Stochastic Gene Expression

By Corentin Briat, Mustafa Khammash · ArXiv: 1810.12293

2020-04-24

A Proper version of Synthesis-based Sparse Audio Declipper

By Pavel Zaviv{s}ka, Pavel Rajmic, Ondv{r}ej Mokry · ArXiv: 1810.12204

2020-01-17

Audio inpainting of music by means of neural networks

By Andres Marafioti, Nicki Holighaus, Piotr Majdak · ArXiv: 1810.12138

2022-02-21

A Scalable Pipelined Dataflow Accelerator for Object Region Proposals on FPGA Platform

By Wenzhi Fu, Jianlei Yang, Pengcheng Dai · ArXiv: 1810.12137

2018-10-30

An improved hybrid CTC-Attention model for speech recognition

By Zhe Yuan, Zhuoran Lyu, Jiwei Li · ArXiv: 1810.12020

2018-11-02

Improved multipath time delay estimation using cepstrum subtraction

By Eric L. Ferguson, Stefan B. Williams, Craig T. Jin · ArXiv: 1810.11990

2018-10-30

STFT spectral loss for training a neural speech waveform model

By Shinji Takaki, Toru Nakashika, Xin Wang · ArXiv: 1810.11945

2018-10-31

Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection

By Yu-Han Shen, Ke-Xin He, Wei-Qiang Zhang · ArXiv: 1810.11939

2025-05-06

Robust Audio Adversarial Example for a Physical Attack

By Hiromu Yakura, Jun Sakuma · ArXiv: 1810.11793

2019-08-20

Short-segment heart sound classification using an ensemble of deep convolutional neural networks

By Fuad Noman, Chee-Ming Ting, Sh-Hussain Salleh · ArXiv: 1810.11573

2020-04-27

«« « 125 126 127 128 129 130 131 132 133 134 » »»