World Library  
Flag as Inappropriate
Email this Article

Codec2

Article Id: WHEBN0035911564
Reproduction Date:

Title: Codec2  
Author: World Heritage Encyclopedia
Language: English
Subject: Multi-Band Excitation, Comparison of audio coding formats, CSipSimple, Speech coding, FreeSWITCH
Collection: Free Audio Codecs, Speech Codecs
Publisher: World Heritage Encyclopedia
Publication
Date:
 

Codec2

Codec2 is a low-bitrate speech audio codec (speech coding) that is patent free and open source.[1] Codec2 compresses speech using sinusoidal coding, a method specialized for human speech. Bit rates of 3200 to 1200 bit/s have been successfully created. Codec2 was designed to be used for amateur radio and other high compression voice applications.

Contents

  • Overview 1
  • History 2
  • References 3
  • External links 4

Overview

Codec2 uses sinusoidal coding to model speech. In sinusoidal coding, spoken audio is recreated by modeling speech as a sum of harmonically related sine waves with independent amplitudes called Line spectral pairs, or LSP. The fundamental frequency of the speaker's voice (pitch) and the amplitude (energy) of the harmonics is encoded and with the LSP's are exchanged across a channel in a digital format. The LSP coefficients represent the Linear Predictive Coding (LPC) model in the frequency domain, and lend themselves to a robust and efficient quantisation of the LPC parameters.[2]

Codec2 consists of 3200, 2400, 1600, 1400, 1300, and 1200 bit/s codec modes. It outperforms most other low-bitrate speech codecs. For example, it uses half the bandwidth of Advanced Multi-Band Excitation to encode speech with similar quality. The speech codec uses PCM sampled audio, and outputs encoded digital bytes. Likewise, you send it encoded digital bytes, and it outputs PCM sampled audio. The audio sample rate is always 8 kHz. Internally, the codec algorithms operate on 10 ms PCM frames, with each of these segments declared voiced (vowel) or unvoiced (consonant).

The digital bytes output are in a packed bit-field format. These bits are also Gray coded before being grouped together. The gray coding might be useful if sent raw, but usually an application will just burst the fields out. The bit-fields make-up the various parameters that are stored or exchanged (pitch, energy, voicing booleans, LSP's, etc).

For example, Mode 3200, has 20 ms of audio converted to 64 Bits. So 64 Bits will be output every 20 ms (50 times a second), for a minimum data rate of 3200 bits/sec. These 64 bits are sent as 8 bytes to the application, which has to unwrap the bit-fields, or send the bytes on a data channel.

Another example is Mode 1300, which is sent 40 ms of audio, and outputs 52 Bits every 40 ms (25 times a second), for a minimum rate of 1300 bits/sec. These 52 bits are sent as 7 bytes to the application or data channel.

The codec was developed by Ph.D. David Rowe (Amateur Radio Call-Sign VK5DGR), with support and cooperation of other researchers (e.g., Jean-Marc Valin from Speex).[3] The codec software is open source and is freely available in a subversion (SVN) repository.[4] The source code is released under LGPL Version 2.[5] It has been tested on Linux and MS Windows.

The codec has been presented in various conferences and has received the 2012 ARRL Technical Innovation Award,[6] and the Linux Australia Conference's Best Presentation Award.[7]

History

Open source evangelist and radio amateur Bruce Perens lobbied for the creation of an alternative, open-source audio codec. Perens did not have the speech-processing background to do the programming himself, and was introduced to speech-coding scientist David Rowe by Jean-Marc Valin. Rowe eventually agreed to take on the project.[8] It has now been fully realized with the creation of Codec2.[9] Rowe has also created a frequency-division multiplex (FDM) soft-modem which carries the digital voice (DV) in only 1.3 kHz of radio bandwidth.[10] The codec and FDM modem are used every day on amateur radio shortwave bands.

Before Codec2, there were no similarly licensed, patent free, low bitrate audio codecs.

References

  1. ^ "DCC2011-Codec2-VK5DGR". 
  2. ^ "Techniques for Harmonic Sinusoidal Coding". 
  3. ^ "A Pitch-Energy Quantizer for Codec2". 
  4. ^ "Repository for Codec2 Source". 
  5. ^ "Codec2 - an Open Source, Low-Bandwidth Voice Codec - Slashdot". 
  6. ^ ARRL Technical Innovation Award in 2012
  7. ^ Linux Australia 2012 conference
  8. ^ "Open Source Low Rate Speech Codec Part 1". 
  9. ^ "Codec2 V0.1 Alpha Released". 
  10. ^ "FDMDV Modem". 

External links

  • www.rowetel.com/codec2.html
  • Various Speech Coding Links
  • FreeDV
This article was sourced from Creative Commons Attribution-ShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and USA.gov, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for USA.gov and content contributors is made possible from the U.S. Congress, E-Government Act of 2002.
 
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
 
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a non-profit organization.
 


Copyright © World Library Foundation. All rights reserved. eBooks from Project Gutenberg are sponsored by the World Library Foundation,
a 501c(4) Member's Support Non-Profit Organization, and is NOT affiliated with any governmental agency or department.