World Library  
Flag as Inappropriate
Email this Article

HTML5 Audio

Article Id: WHEBN0034527663
Reproduction Date:

Title: HTML5 Audio  
Author: World Heritage Encyclopedia
Language: English
Subject: Comparison of layout engines (HTML5), HTML, WebGL, Microdata (HTML), Web Messaging
Collection: Digital Audio, Html5
Publisher: World Heritage Encyclopedia

HTML5 Audio

HTML5 Audio is a subject of the HTML5 specification, investigating audio input, playback, synthesis, as well as speech to text in the browser.


  • 1
    • Supported browsers 1.1
    • Supported audio coding formats 1.2
  • Web Audio API and MediaStream Processing API 2
    • Supported browsers 2.1
  • Web Speech API 3
    • Supported Browsers 3.1
  • See also 4
  • References 5
  • External links 6



  • global attributes (accesskey; class; contenteditable; contextmenu; dir; draggable; dropzone; hidden; id; lang; spellcheck; style; tabindex; title; translate)
  • autoplay = "autoplay" or "" (empty string) or empty
    Instructs the UA to automatically begin playback of the audio stream as soon as it can do so without stopping.
  • preload = "none" or "metadata" or "auto" or "" (empty string) or empty
    Represents a hint to the UA about whether optimistic downloading of the audio stream itself or its metadata is considered worthwhile.
    • "none": Hints to the User-Agent that the user is not expected to need the audio stream, or that minimizing unnecessary traffic is desirable.
    • "metadata": Hints to the User-Agent that the user is not expected to need the audio stream, but that fetching its metadata (duration and so on) is desirable.
    • "auto": Hints to the User-Agent that optimistically downloading the entire audio stream is considered desirable.
  • controls = "controls" or "" (empty string) or empty
    Instructs the User-Agent to expose a user interface for controlling playback of the audio stream.
  • loop = "loop" or "" (empty string) or empty
    Instructs the User-Agent to seek back to the start of the audio stream upon reaching the end.
  • mediagroup = string
    Instructs the User-Agent to link multiple videos and/or audio streams together.
  • muted = "muted" or "" (empty string) or empty
    Represents the default state of the audio stream, potentially overriding user preferences.
  • src = non-empty [URL] potentially surrounded by spaces
    The URL for the audio stream.



Supported browsers

Supported audio coding formats

This table documents the current support for audio coding formats by the element.

Browser Operating system Formats supported by different web browsers
Ogg Vorbis WAV PCM MP3 AAC WebM Vorbis Ogg Opus WebM Opus
Google Chrome All supported 9 Yes Yes Yes Yes 25 (since v31 in Windows) Yes
Internet Explorer Windows No No 9 9 No No No
Mozilla Firefox All supported 3.5 3.5 Windows (21.0), Linux (24.0, needs a gstreamer codec), OS X (26.0) Windows (21.0), Linux (24.0, needs a gstreamer codec), OS X (34.0) 4.0 15.0 28.0[4]
Opera All supported 10.50 11.00 14 14 10.60 14 Yes
Safari OS X Manual install 3.1 3.1 3.1 No No No

The adoption of HTML5 audio, as with HTML5 video, has become polarized between proponents of free and patented formats. In 2007, the recommendation to use Vorbis was retracted from the specification by the W3C together with that to use Ogg Theora, citing the lack of a format accepted by all the major browser vendors.

ISO/IEC-defined formats AAC and the older MP3.

Mozilla and Opera, controlling 24% of the market, support the free and open, royalty-free Vorbis format in Ogg and WebM containers, and criticize the patent-encumbered nature of MP3 and AAC, which are guaranteed to be “non-free”.

Google, controlling 27% of the market, has so far provided support for all common formats.

Gecko-based applications and Safari also support PCM audio in a WAVE container.[5]

In 2012, the free and open royalty-free Opus format was released and standardized by IETF. It is supported by Mozilla’s software since Gecko version 15.[6][7][8]

Web Audio API and MediaStream Processing API

The Web Audio API specification developed by W3C describes a high-level JavaScript API for processing and synthesizing audio in web applications. The primary paradigm is of an audio routing graph, where a number of AudioNode objects are connected together to define the overall audio rendering. The actual processing will primarily take place in the underlying implementation (typically optimized Assembly / C / C++ code), but direct JavaScript processing and synthesis is also supported.[9]

Mozilla's Firefox browser implements a similar Audio Data API extension since version 4, implemented in 2010 [10] and released in 2011, but Mozilla warns it is non-standard and deprecated, and recommends the Web Audio API instead.[11] Some JavaScript audio processing and synthesis libraries such as Audiolet support both APIs.

The W3C Audio Working Group is also considering the MediaStream Processing API specification developed by Mozilla.[12] In addition to audio mixing and processing, it covers more general media streaming, including synchronization with HTML elements, capture of audio and video streams, and peer-to-peer routing of such media streams.[13]

Supported browsers

  • PC
  • Mobile
    • Google Chrome for Android 28 (Enabled by default since 29)
    • Mobile Safari 6 (Has restrictions on use (Muted unless user called))
    • Mozilla Firefox 23 (Enabled by default since 25)
    • Tizen

Web Speech API

The Web Speech API aims to provide an alternative input method for web applications (without using a keyboard). With this API, developers can give web apps the ability to transcribe your voice to text, from your computer's microphone. The recorded audio is sent to speech servers for transcription, after which the text is typed out for you. The API itself is agnostic of the underlying speech recognition implementation and can support both server based as well as embedded recognizers.[16] The HTML Speech Incubator group has proposed the implementation of audio-speech technology in browsers in the form of uniform, cross-platform APIs. The API contains both:[17]

  • Speech Input API
  • Text to Speech API

Google integrated this feature into Google Chrome on March 2011.[18] Letting its users search the web with their voice with code like:

Supported Browsers

See also


  1. ^ HTML5 audio element - W3C
  2. ^ HTML/Elements/audio
  3. ^ a b About HTML5 Audio and Video - Safari HTML5 Audio and Video Guide
  4. ^
  5. ^
  6. ^
  7. ^
  8. ^
  9. ^ Chris Rogers (2012-03-15). "Web Audio API".  
  10. ^ "Audio Data API". 
  11. ^ "Introducing the Audio API extension".  
  12. ^ "Audio Processing API".  
  13. ^ Robert O'Callahan (2012-05-31). "MediaStream Processing API".  
  14. ^ Web Audio API is now available in Chrome
  15. ^ Scott Gilbertson (2011-09-19). "Chrome 14 Adds Better Audio, ‘Native Client’ Support".  
  16. ^ "API draft". Retrieved January 28, 2012. 
  17. ^ "HTML5 Speech API". Retrieved January 28, 2012. 
  18. ^ "Talking to your computer". Retrieved January 28, 2012. 

External links

  • HTML/Elements/audio - W3C Wiki
  • HTML5 audio element - W3C
  • Web Audio API - W3C
  • MediaStream Processing API - W3C
  • Web Speech API - W3C
  • Web Audio DAW - Github
This article was sourced from Creative Commons Attribution-ShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for and content contributors is made possible from the U.S. Congress, E-Government Act of 2002.
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a non-profit organization.

Copyright © World Library Foundation. All rights reserved. eBooks from Project Gutenberg are sponsored by the World Library Foundation,
a 501c(4) Member's Support Non-Profit Organization, and is NOT affiliated with any governmental agency or department.