User Tools

Site Tools


projects:cnn4asr:start

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
projects:cnn4asr:start [2015/02/11 16:27] hjprojects:cnn4asr:start [2015/02/11 16:34] (current) hj
Line 3: Line 3:
 ===== CNNs for Speech Processing ===== ===== CNNs for Speech Processing =====
  
-{{ :projects:cnn4asr:screen_shot_2015-02-11_at_11.27.34_am.png0x250 |}}+{{ :projects:cnn4asr:screen_shot_2015-02-11_at_11.27.34_am.png?0x300 |}}
  
 +\\
 +\\
 +
 +We propose to use convolutional neural networks (CNNs) for speech recognition, where convolution is applied in the frequency domain to normalize speech variations. We further propose a limited-weight-sharing scheme that can better model speech features. The special structure such as local connectivity, weight sharing, and pooling in
 +CNNs exhibits some degree of invariance to small shifts of speech
 +features along the frequency axis, which is important to deal with
 +speaker and environment variations.
 +
 +\\
 +**Reference:** \\
 +
 +[1] O. Abdel-Hamid, A. Mohamed, **H. Jiang**, L. Deng, G. Penn, D. Yu, "[[http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6857341&tag=1|Convolutional Neural Networks for Speech Recognition]]," //IEEE/ACM Trans. on Audio, Speech and Language Processing//, pp.1533-1545, Vol. 22, No. 10,  October 2014. \\ \\
 +[2] O. Abdel-Hamid, A. Mohamed, **H. Jiang**, G. Penn, "Applying Convolutional Neural Networks Concepts to Hybrid NN-HMM Model for Speech Recognition," //Proc. of IEEE International Conference on Acoustic, Speech, Signal Processing (ICASSP'2012)//, Japan, March 2012.
projects/cnn4asr/start.1423672074.txt.gz · Last modified: by hj