A subspace based progressive coding method for speech compression

Keser, Serkan; Gerek, Ömer Nezih; Seke, Erol; Gillmezoğlu, Mehmet Bilginer

dc.contributor.author	Keser, Serkan
dc.contributor.author	Gerek, Ömer Nezih
dc.contributor.author	Seke, Erol
dc.contributor.author	Gillmezoğlu, Mehmet Bilginer
dc.date.accessioned	2019-10-21T20:11:57Z
dc.date.available	2019-10-21T20:11:57Z
dc.date.issued	2017
dc.identifier.issn	0167-6393
dc.identifier.issn	1872-7182
dc.identifier.uri	https://dx.doi.org/10.1016/j.specom.2017.09.002
dc.identifier.uri	https://hdl.handle.net/11421/20364
dc.description	WOS: 000414819300005	en_US
dc.description.abstract	In this study, two novel methods, which are based on Karhunen Loeve Transform (KLT) and Independent Component Analysis (ICA), are proposed for coding of speech signals. Instead of immediately dealing with eigenvalue magnitudes, the KLT- and ICA-based methods use eigenvectors of covariance matrices (or independent components for ICA) by geometrically grouping these vectors into fewer numbers of vectors. In this way, a data representation compaction is achieved. Further compression is achieved through discarding autocovariance eigenvectors corresponding to the small eigenvalues and applying vector quantization on the remaining eigenvectors. Additionally, this study proposes an iterative error refinement process, which uses the rest of the available bandwidth in order to transmit an efficient representation of the description error for better SNR. The overall process constitutes a new approach to efficient speech coding, with ICA being used in subspace speech coding for the first time. Constant bit rate (CBR) and variable bit rate (VBR) coding algorithms are employed with the proposed methods. TIMIT speech database is used in the experimental studies. Speech signals are synthesized at 2.4 kbps, 8 kbps, 12.2 kbps, 16 kbps, 16.4kbps and 19.85 kbps rates by using various frame lengths. The qualities of synthesized speech signals are compared to those of available speech codecs, i.e., LPC (2.4 kbps), G.728 (LD-CELP, 16 kbps), G.729A (CS-CELP, 8 kbps), EVS (16.4 kbps), AMR-NB (12.2 kbps) and AMR-WB (19.85 kbps)	en_US
dc.language.iso	eng	en_US
dc.publisher	Elsevier Science BV	en_US
dc.relation.isversionof	10.1016/j.specom.2017.09.002	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	Independent Component Analysis (Ica)	en_US
dc.subject	Karhunen Loeve Transform (Klt)	en_US
dc.subject	Speech Codecs	en_US
dc.subject	Subspace Methods	en_US
dc.title	A subspace based progressive coding method for speech compression	en_US
dc.type	article	en_US
dc.relation.journal	Speech Communication	en_US
dc.contributor.department	Anadolu Üniversitesi, Mühendislik Fakültesi, Elektrik ve Elektronik Mühendisliği Bölümü	en_US
dc.identifier.volume	94	en_US
dc.identifier.startpage	50	en_US
dc.identifier.endpage	61	en_US
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
dc.contributor.institutionauthor	Gerek, Ömer Nezih

Bu öğenin dosyaları:

Ad:: 20364.pdf
Boyut:: 930.9Kb
Biçim:: PDF
Açıklama:: Tam Metin / Full Text

Göster/Aç

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Makale Koleksiyonu [193]
Scopus İndeksli Yayınlar Koleksiyonu [8325]
Scopus Indexed Publications Collection
WoS İndeksli Yayınlar Koleksiyonu [7605]
WoS Indexed Publications Collection

Basit öğe kaydını göster