<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Off the Clock</title>
	<atom:link href="http://www.octasic.com/blog/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.octasic.com/blog</link>
	<description></description>
	<lastBuildDate>Fri, 10 May 2013 14:54:45 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Small-cell SoCs may be just what military broadband systems are looking for</title>
		<link>http://www.octasic.com/blog/2013/05/09/small-cell-socs-may-be-just-what-military-broadband-systems-are-looking-for/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=small-cell-socs-may-be-just-what-military-broadband-systems-are-looking-for</link>
		<comments>http://www.octasic.com/blog/2013/05/09/small-cell-socs-may-be-just-what-military-broadband-systems-are-looking-for/#comments</comments>
		<pubDate>Thu, 09 May 2013 20:28:31 +0000</pubDate>
		<dc:creator>Emmanuel Gresset</dc:creator>
				<category><![CDATA[Baseband processing]]></category>
		<category><![CDATA[Wireless]]></category>
		<category><![CDATA[2g]]></category>
		<category><![CDATA[3G]]></category>
		<category><![CDATA[4G]]></category>
		<category><![CDATA[advanced military radios]]></category>
		<category><![CDATA[broadband]]></category>
		<category><![CDATA[civilian]]></category>
		<category><![CDATA[gsm]]></category>
		<category><![CDATA[hspa]]></category>
		<category><![CDATA[lte]]></category>
		<category><![CDATA[military]]></category>
		<category><![CDATA[small cell basestations]]></category>
		<category><![CDATA[small cells]]></category>

		<guid isPermaLink="false">http://www.octasic.com/blog/?p=447</guid>
		<description><![CDATA[The use of small cells by mobile operators has been getting a lot of buzz lately. One thing that’s fallen below the radar (no pun intended) is the use of small cells in military systems. In the world of military broadband wireless networks, ambitions have risen in parallel with the ubiquitous deployment of civilian broadband wireless networks. However, the development of advanced military radios has been scaled back in some cases, and manufacturers are under pressure to deliver affordable solutions, quickly and through low-risk programs. To deliver the broadband performance the military needs, at a price governments can afford, some manufacturers have integrated advanced commercial Systems-on-a-Chip (SoCs) designed for civilian small-cell base stations into their military software defined radio (SDR) systems.

Defense OEMs’ pivot towards COTS SDR technologies is understandable – the performance of the latest civilian base station SoCs is spectacular. Not only do these systems implement multiple air interface standards (2G/GSM, 3G/HSPA+, as well as 4G/LTE) and offer very high end-user bitrates (over 10 Mbps), but they also implement advanced adaptive antenna techniques such as 2X2 and 4X4 MIMO. Although small-cell SoCs incorporate many of the features required by tactical radio designers, some SoCs are better suited to military applications than others. In this article, I’ll highlight these differences.]]></description>
				<content:encoded><![CDATA[<p>The use of small cells by mobile operators has been getting a lot of buzz lately. One thing that’s fallen below the radar (no pun intended) is the use of small cells in military systems. In the world of military broadband wireless networks, ambitions have risen in parallel with the ubiquitous deployment of civilian broadband wireless networks. However, the development of advanced military radios has been scaled back in some cases, and manufacturers are under pressure to deliver affordable solutions, quickly and through low-risk programs. To deliver the broadband performance the military needs, at a price governments can afford, some manufacturers have integrated advanced commercial Systems-on-a-Chip (SoCs) designed for civilian small-cell base stations into their military software defined radio (SDR) systems.</p>
<p><b>High performance commercial small-cell SoCs</b></p>
<p>Defense OEMs’ pivot towards COTS SDR technologies is understandable – the performance of the latest civilian base station SoCs is spectacular. Not only do these systems implement multiple air interface standards (2G/GSM, 3G/HSPA+, as well as 4G/LTE) and offer very high end-user bitrates (over 10 Mbps), but they also implement advanced adaptive antenna techniques such as 2X2 and 4X4 MIMO. Although small-cell SoCs incorporate many of the features required by tactical radio designers, some SoCs are better suited to military applications than others. In this article, I’ll highlight these differences.</p>
<p><b>The exacting demands of military radio</b></p>
<p>Front-line users of military radios have a long list of critical requirements, including:</p>
<ul>
<li>Low power consumption and long battery life</li>
<li>Small size and low weight</li>
<li>High reliability under adverse conditions</li>
<li>Long range</li>
<li>Frequency agility and adaptability to congested spectrum environments</li>
<li>High bandwidth (to the individual user)</li>
<li>Support for on-the-move operation</li>
<li>Compatibility with major civilian mobile wireless standards</li>
</ul>
<p>In addition, military radio designers have their own set of requirements, which are driven not only by performance demands, but also by economic realities: improving Time-to-Market (TTM), reducing development costs, and minimizing project risk.</p>
<p><b>Selecting a small-cell SoC for defense applications</b></p>
<p>Without going into a complete shopping list of detailed specifications, the following list highlights key parameters that designers should focus on when evaluating SoCs for use in military broadband wireless applications:</p>
<ul>
<li><b>Size and Power</b>. Military OEMs need to focus on solutions that offer the smallest footprint and the lowest chip and system power consumption. For instance, leading SoCs consume 5W or less for the chip alone. Radio modules (minus the power amplifier) are in the range of 5W to 10W. In terms of size, the world’s smallest small cell is the size of a smartphone!</li>
<li><b>Long Range</b>. Typical small-cell SoCs designed for commercial deployments support cell sizes on the order of 1 Km. Some of the more recent SoCs can now also target the military market because they support cell sizes up to 20 Km.</li>
<li><b>Frequency Agility and Cognitive Radio capability</b>. The combination of frequency agility and cognitive radio is truly powerful, but full-time cognitive radio operation requires a dedicated RF listen port in addition to the standard TX/RX ports. The RF listen capability can also be leveraged to optimize On-the-Move operation, adapting to the continuously changing RF environment faced by vehicle-mounted base stations.</li>
<li><b>Multi-standard support</b>. Military broadband wireless SoCs should support standard mobile waveforms from 2G to 4G LTE for compatibility with civilian infrastructure, as well as customized versions of civilian waveforms, and fully custom tactical waveforms. An ideal SDR platform for military base stations should also include an integrated development environment to easily implement airlink customizations and military-specific applications (such as cognitive radio algorithms).</li>
</ul>
<p><b>Device and application match-up</b></p>
<p>Bringing advanced wireless broadband services to the harsh environment of the battlefield has proven to be a tough job. The massive global effort made to develop advanced base station SoCs for the commercial mobile wireless market offers military radio designers the chance to strike a new balance between extreme performance, cost-effectiveness, and low risk.  These latest SoCs and small cell platforms are sending shockwaves through the military communications market, offering new possibilities for innovation.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.octasic.com/blog/2013/05/09/small-cell-socs-may-be-just-what-military-broadband-systems-are-looking-for/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>WebRTC’s Hidden Audio Gem, Opus</title>
		<link>http://www.octasic.com/blog/2013/03/08/webrtcs-hidden-audio-gem-opus/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=webrtcs-hidden-audio-gem-opus</link>
		<comments>http://www.octasic.com/blog/2013/03/08/webrtcs-hidden-audio-gem-opus/#comments</comments>
		<pubDate>Fri, 08 Mar 2013 20:18:23 +0000</pubDate>
		<dc:creator>James Awad</dc:creator>
				<category><![CDATA[Audio]]></category>
		<category><![CDATA[Media Processing]]></category>
		<category><![CDATA[codec]]></category>
		<category><![CDATA[opus codec]]></category>
		<category><![CDATA[WebRTC]]></category>

		<guid isPermaLink="false">http://www.octasic.com/blog/?p=442</guid>
		<description><![CDATA[Although most descriptions of WebRTC tout its rich media sharing capabilities, most discussion centers on video and data sharing. As it happens, WebRTC’s audio codec, Opus (at one time called “Harmony”), is among the most remarkable of its features. A free audio codec in development since 2007 by Octasic, Mozilla and the Xiph Foundation in collaboration with Google, Microsoft’s Skype unit and Broadcom, Opus was recently ratified in 2012 by the IETF as RFC 6716. Moreover, after considering ten codecs, the IETF also reached “strong consensus” to adopt Opus as a mandatory-to-implement (MTI) codec for WebRTC.

Update: Since then there has been a push to adopt more legacy codecs to facilitate interworking. See http://tools.ietf.org/html/draft-marjou-rtcweb-audio-codecs-for-interop-00

In any case, to handle the demands of WebRTC, Opus is not your run-of-the-mill codec locked into a specific bitrate and narrowly optimized for speech, high quality music or video. No, Opus can handle everything from low bitrate voice to high bitrate music coding thanks to the fact that it is an amalgam of Skype's SILK codec that is optimized for low rate voice coding, and Xiph.org's Constrained Energy Lapped Transform CELT codec, a very low delay, low CPU/memory requirement successor to Vorbis that handles higher bitrates and thus higher quality audio.]]></description>
				<content:encoded><![CDATA[<p>Although most descriptions of WebRTC tout its rich media sharing capabilities, most discussion centers on video and data sharing. As it happens, WebRTC’s audio codec, Opus (at one time called “Harmony”), is among the most remarkable of its features. A free audio codec in development since 2007 by Octasic, Mozilla and the Xiph Foundation in collaboration with Google, Microsoft’s Skype unit and Broadcom, Opus was recently ratified in 2012 by the IETF as RFC 6716. Moreover, after considering ten codecs, the IETF also reached “strong consensus” to adopt Opus as a mandatory-to-implement (MTI) codec for WebRTC.</p>
<p><i>Update: Since then there has been a push to adopt more legacy codecs to facilitate interworking. See </i><a href="http://tools.ietf.org/html/draft-marjou-rtcweb-audio-codecs-for-interop-00"><i>http://tools.ietf.org/html/draft-marjou-rtcweb-audio-codecs-for-interop-00</i></a><i> </i></p>
<p>In any case, to handle the demands of WebRTC, Opus is not your run-of-the-mill codec locked into a specific bitrate and narrowly optimized for speech, high quality music or video. No, Opus can handle everything from low bitrate voice to high bitrate music coding thanks to the fact that it is an amalgam of Skype&#8217;s SILK codec that is optimized for low rate voice coding, and Xiph.org&#8217;s Constrained Energy Lapped Transform CELT codec, a very low delay, low CPU/memory requirement successor to Vorbis that handles higher bitrates and thus higher quality audio.</p>
<p>Opus also has a hybrid mode wherein SILK encodes the low frequency band and the high frequencies are taken care of by CELT. This explains the chimera-like nature of Opus and how it can effortlessly tackle everything from low bitrate voice coding, Internet radio, adaptive streaming, game sound effects, storage applications, and up to and including high-rate music coding. It handles bitrates from 6 kb/sec to 510 kb/sec, sampling rates from 8 kHz to 48 kHz, frame sizes from 2.5 milliseconds to 60 milliseconds, and it supports both Constant bit-rate (CBR) and variable bit-rate (VBR).</p>
<p>All of this flexibility is why Opus co-creator Jean-Marc Valin (now at Mozilla) calls it &#8220;<i>the Swiss army knife of audio codecs.</i>&#8221;</p>
<p>Or, as Xiph.org founder Monty Montgomery has remarked, “<i>It is The One Codec.</i>”</p>
<p>For more details, see the full specification in <a href="http://tools.ietf.org/html/rfc6716">RFC 6716</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.octasic.com/blog/2013/03/08/webrtcs-hidden-audio-gem-opus/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>WebRTC for dummies</title>
		<link>http://www.octasic.com/blog/2013/01/11/webrtc-for-dummies/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=webrtc-for-dummies</link>
		<comments>http://www.octasic.com/blog/2013/01/11/webrtc-for-dummies/#comments</comments>
		<pubDate>Fri, 11 Jan 2013 20:15:06 +0000</pubDate>
		<dc:creator>James Awad</dc:creator>
				<category><![CDATA[Media Processing]]></category>
		<category><![CDATA[WebRTC]]></category>

		<guid isPermaLink="false">http://www.octasic.com/blog/?p=433</guid>
		<description><![CDATA[When Ericsson officials at the recent AT&#038;T Developer Summit in Las Vegas said that that more innovation is happening in communication today than has happened in the previous 30 years, one of the items they probably had in mind was WebRTC, since their new Web Real Time Communications (WebRTC) development platform enables any device to be reached by an existing mobile number.

Indeed, in the coming weeks you’ll be seeing many WebRTC-related blogs in this space. Why? As Phil Edholm remarked at nojitter.com, “Potentially, WebRTC and HTML5 could enable the same transformation for real-time communications that the original browser did for information.”]]></description>
				<content:encoded><![CDATA[<p>When Ericsson officials at the recent AT&amp;T Developer Summit in Las Vegas said that that more innovation is happening in communication today than has happened in the previous 30 years, one of the items they probably had in mind was WebRTC, since their new Web Real Time Communications (WebRTC) development platform enables any device to be reached by an existing mobile number.</p>
<p>Indeed, in the coming weeks you’ll be seeing many WebRTC-related blogs in this space. Why? As Phil Edholm remarked at nojitter.com, “<i>Potentially, WebRTC and HTML5 could enable the same transformation for real-time communications that the original browser did for information.</i>”</p>
<p>Ostensibly, WebRTC (Web Real-Time Communication) is a media engine and set of APIs (free and openly available at webrtc.org) that bestows upon web browsers Real-Time Communications (RTC) capabilities—particularly low-cost, high-quality, voice and video services and P2P file sharing. This is all done via simple Javascript APIs and HTML5, <i>not</i> by downloading, installing and updating plugins of any sort. RTC is already utilized by web services such as Skype, Facebook and Google Hangouts, but they require downloads, native apps or plugins. Plugins are prone to all sorts of problems, such as those relating to security (since the application no longer runs in the sandboxed browser environment), maintenance, troubleshooting and accessing the full range of native browser resources due to possible privilege restrictions. Plus they can be a pain to install, so many people simply don’t.</p>
<p>With the upcoming generation of WebRTC-enabled browsers, however, a web services app can easily instruct the browser to make a native real-time voice, video or data connection to another WebRTC device or to a WebRTC media server using RTP.</p>
<p>But the ramifications of WebRTC go far beyond this simple description, since by incorporating such simple, standardized real-time communication capabilities into the fabric of the web, a new open development platform and philosophy emerges enabling any application and any device to communicate with each other. This is demonstrated by the debut of Plivo’s new SDK that gives developers the tools to connect WebRTC apps to standard SIP lines, which means that you can now call people, businesses or call centers that rely on regular phone lines from your browser over WebRTC, and <i>vice versa</i>.</p>
<h5><b>Origins</b></h5>
<p>Google’s Gmail video chat functionality caught our fancy in 2008, and in 2011 Google introduced Hangouts which scaled up real-time video chat technology based on the Google Talk service, along with Gmail. Google then got the WebRTC ball rolling when in 2010 it acquired Global IP Solutions, or GIPS (formerly Global IP Sound) which devised RTC codecs and echo cancellation technology. Google then spawned an open source version of WebRTC and began to incorporate it into Chrome (the folks at Mozilla/Firefox and Opera followed suit). Google then approached the World Wide Web Consortium (W3C) and Internet Engineering Task Force (IETF) to guarantee industry acceptance.</p>
<p>Now, whereas the WebRTC APIs used by a web application are being drafted by the World Wide Web Consortium (W3C), the RTCWEB group at the IETF is defining the underlying formats and protocol set used by the web apps for inter-browser communication.</p>
<h5><b>Under the Hood</b></h5>
<p>WebRTC functionality falls into three categories:</p>
<p>The first, <b>Media Streams</b> concern granting web apps and sites access to the user PC’s camera and mic, via the getUserMedia API. Many interesting apps can be devised with this API alone (such as Chris Wilson’s <a href="http://webaudiodemos.appspot.com/" target="_blank">examples</a> for web audio).</p>
<p>Second, <b>PeerConnection</b> is an API to the engine that enables direct, high-quality, peer -to-peer connections between two web browsers for audio and video calls.</p>
<p>Third, the <b>DataChannels </b>API enables any web app to send data in a peer-to-peer fashion.</p>
<p>The Mozilla folks have been particularly interested in the DataChannels aspect of WebRTC, and they immediately began work on building it into Firefox. They believe it will transform social applications beyond simple audio and video conferencing, since while teleconferencing you could now in theory easily share photos, videos or links to interesting web content by dragging them into a video chat window.</p>
<p>The first browsers with some WebRTC-capability, Chrome and Mozilla, appeared in 2012. In fact, Google, Mozilla, and Opera have been key sponsors of WebRTC and their first browsers with some WebRTC capabilities appeared toward the end of 2012; AT&amp;T, Cisco Systems, and Plantronics are major boosters of it too. Interestingly, Plantronics CTO Joe Burton (who delivered a keynote address at the world’s first WebRTC conference) believes that WebRTC and HTML5 are so intertwined that WebRTC protocols should be incorporated into the HTML5 spec itself. (But don’t expect that anytime soon.)</p>
<h5><b>WebRTC will Revolutionize All of Communications</b></h5>
<p>With the rise of WebRTC, real-time communications and data sharing RTC will now become ubiquitous. With true open standards for real-time communication for video, audio and data, innovation in the multi-trillion dollar communications industry will move at warp speed. Customers and service reps will soon see and talk to each other via seamless person-to-person communication, as will opponents in online games. Moreover, WebRTC support for streaming video and audio from native devices such as webcams allows for such telepresence-like niceties as headtracking—a miraculous increase in capabilities without the need for downloads, installation, tweaking, or what-not. Just surf to the proper address and the simple, standardized web will APIs kick right in.</p>
<p>As WebRTC takes hold across computers and all sorts of devices, we have the real ability to create the next generation phone network where every WebRTC-enabled device can communicate with amazing audio and video quality.</p>
<p>Perhaps it really is the most exciting time in the history of the telecom industry, with WebRTC leading the charge, driving innovation at an unheard-of pace.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.octasic.com/blog/2013/01/11/webrtc-for-dummies/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>VoIP and Echo Cancellation</title>
		<link>http://www.octasic.com/blog/2012/04/05/voip-and-echo-cancellation/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=voip-and-echo-cancellation</link>
		<comments>http://www.octasic.com/blog/2012/04/05/voip-and-echo-cancellation/#comments</comments>
		<pubDate>Thu, 05 Apr 2012 17:52:37 +0000</pubDate>
		<dc:creator>Guillaume Renaud</dc:creator>
				<category><![CDATA[Audio]]></category>
		<category><![CDATA[Media Processing]]></category>
		<category><![CDATA[echo cancellation]]></category>
		<category><![CDATA[voip]]></category>

		<guid isPermaLink="false">http://www.octasic.com/blog/?p=414</guid>
		<description><![CDATA[<p><a href="http://www.octasic.com/blog/wp-content/uploads/2012/04/echo_cancel_1.png"><img class="alignright size-thumbnail wp-image-417" title="echo generation in voip" src="http://www.octasic.com/blog/wp-content/uploads/2012/04/echo_cancel_1-680x300.png" alt="" width="680" height="300" /></a>Ever since Voice over Internet Protocol (VoIP) was introduced, all calls going to the Public Switched Telephone Network (PSTN) have been plagued with line or electrical echo. I often get asked where these electrical echoes come from and why it is inevitable when using VoIP. We’ve given many seminars on the subject, especially in the Asterisk space where people are suddenly confronted with this problem when building equipment. I’ve put together a basic introduction explaining why echo cancellation is needed when using VoIP over PSTN.</p>
<p>Electrical or line echo is inherent to the PSTN, it has always been present and cannot be avoided because of the nature of the lines connected to homes and offices. These local or subscriber loop circuits use two wires to carry the voice signals while the voice channels use four wires for bi-directional communications beyond the first switch that is the local exchange. The conversion between the four wire and two wire electrical circuits is done by transformer called a telephone hybrid whose goal is to separate the signals’ directions and adapt the impedance of both circuits. Like all analog circuits, the hybrids can’t perfectly match the impedances and that causes part of the signal &#8230;</p>]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.octasic.com/blog/wp-content/uploads/2012/04/echo_cancel_1.png"><img class="alignright size-thumbnail wp-image-417" title="echo generation in voip" src="http://www.octasic.com/blog/wp-content/uploads/2012/04/echo_cancel_1-680x300.png" alt="" width="680" height="300" /></a>Ever since Voice over Internet Protocol (VoIP) was introduced, all calls going to the Public Switched Telephone Network (PSTN) have been plagued with line or electrical echo. I often get asked where these electrical echoes come from and why it is inevitable when using VoIP. We’ve given many seminars on the subject, especially in the Asterisk space where people are suddenly confronted with this problem when building equipment. I’ve put together a basic introduction explaining why echo cancellation is needed when using VoIP over PSTN.</p>
<p>Electrical or line echo is inherent to the PSTN, it has always been present and cannot be avoided because of the nature of the lines connected to homes and offices. These local or subscriber loop circuits use two wires to carry the voice signals while the voice channels use four wires for bi-directional communications beyond the first switch that is the local exchange. The conversion between the four wire and two wire electrical circuits is done by transformer called a telephone hybrid whose goal is to separate the signals’ directions and adapt the impedance of both circuits. Like all analog circuits, the hybrids can’t perfectly match the impedances and that causes part of the signal to be electrically reflected back; this is the source of electrical echo.</p>
<p>You might be asking yourself why this wasn’t a problem before VoIP was introduced, which brings us to the second characteristic of echo: delay. Echo can only be perceived by the human ear when it as a delay of 10ms or more but it only becomes disturbing when the delay exceeds 30ms and unacceptable at 50ms or more. In the days when telephony was all circuit switched, the delay was only caused by the length of electrical wires in the circuit connecting the two endpoints. Echo cancellers were only needed on long distance calls because only they involved long enough circuits to make the echo disturbing.</p>
<p>In a VoIP scenario we are not talking about circuit switching anymore since the core of the network is packet switched. Delays are guaranteed to exceed the 30ms threshold because of the additional time it takes to encode, packetize, route, decode and de-packetize the voice stream. This is why even local calls require a solution to the line echo problem.</p>
<p><a href="http://www.octasic.com/blog/wp-content/uploads/2012/04/echo_cancel_2.png"><img class="alignright size-thumbnail wp-image-419" title="packet delay and echo in voip" src="http://www.octasic.com/blog/wp-content/uploads/2012/04/echo_cancel_2-680x300.png" alt="" width="680" height="300" /></a></p>
<p>There have been a lot of attempts made to solve this problem by either tuning the gains to minimize the effect of the echo or by implementing echo cancellation in software. Some of these software echo cancellers are good enough to resolve the issue but will impact the CPU usage of the system, lowering the number of channels that it can handle or raising the minimum hardware requirements. Today most interface card vendors include hardware echo cancellation in their latest products, to resolve the echo problem without taxing the host CPU.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.octasic.com/blog/2012/04/05/voip-and-echo-cancellation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>An introduction to Internet Video Delivery</title>
		<link>http://www.octasic.com/blog/2011/11/03/an-introduction-to-internet-video-delivery/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=an-introduction-to-internet-video-delivery</link>
		<comments>http://www.octasic.com/blog/2011/11/03/an-introduction-to-internet-video-delivery/#comments</comments>
		<pubDate>Thu, 03 Nov 2011 19:00:00 +0000</pubDate>
		<dc:creator>Hui Pang</dc:creator>
				<category><![CDATA[Media Processing]]></category>
		<category><![CDATA[Video]]></category>

		<guid isPermaLink="false">http://www.octasic.com/blog/?p=386</guid>
		<description><![CDATA[<p><a href="www.octasic.com/blog/2011/11/03/an-introduction-to-internet-video-delivery/"><img class="alignright size-full wp-image-398" title="adaptive streaming diagram" src="http://www.octasic.com/blog/wp-content/uploads/2011/10/pp_adaptivestreaming.png" alt="" width="680" height="374" /></a>As video becomes one of the most demanding services for network traffic and Internet video becomes an important part of modern life, the quality of experience needs to meet the user’s expectations, regardless of the device or network being used. More and more video is being consumed on smartphones and tablets. The smaller screens allow for lower bit-rates, but the video playback need to start quickly and remain smooth throughout.</p>
<p>Internet Video delivery is challenging because of factors such as high bitrates and sensitivity to delay or packet loss.</p>
<p><strong>Video Streaming</strong></p>
<p>In the past, video streaming was typically associated to RTSP, with RTP used for transmission.  This protocol uses “VCR-like” commands such as PLAY and PAUSE. In this scheme, the server has to keep track of the client’s state. The server starts playing a stream when giving the PLAY command, and has to maintain the state of each session in order to know what packet to send next. The video stream is based on a single “track”, a file with a fixed encoding profile that cannot change. The quality would quickly suffer when there was a shortage of bandwidth such as congestion leading to packet loss.</p>
<p>Today, HTTP based adaptive &#8230;</p>]]></description>
				<content:encoded><![CDATA[<p><a href="www.octasic.com/blog/2011/11/03/an-introduction-to-internet-video-delivery/"><img class="alignright size-full wp-image-398" title="adaptive streaming diagram" src="http://www.octasic.com/blog/wp-content/uploads/2011/10/pp_adaptivestreaming.png" alt="" width="680" height="374" /></a>As video becomes one of the most demanding services for network traffic and Internet video becomes an important part of modern life, the quality of experience needs to meet the user’s expectations, regardless of the device or network being used. More and more video is being consumed on smartphones and tablets. The smaller screens allow for lower bit-rates, but the video playback need to start quickly and remain smooth throughout.</p>
<p>Internet Video delivery is challenging because of factors such as high bitrates and sensitivity to delay or packet loss.</p>
<p><strong>Video Streaming</strong></p>
<p>In the past, video streaming was typically associated to RTSP, with RTP used for transmission.  This protocol uses “VCR-like” commands such as PLAY and PAUSE. In this scheme, the server has to keep track of the client’s state. The server starts playing a stream when giving the PLAY command, and has to maintain the state of each session in order to know what packet to send next. The video stream is based on a single “track”, a file with a fixed encoding profile that cannot change. The quality would quickly suffer when there was a shortage of bandwidth such as congestion leading to packet loss.</p>
<p>Today, HTTP based adaptive streaming services provide a more efficient means of delivery over the Internet.  Similarly to RTSP, the video stream can start playing out before the entire file has been downloaded. The main advantage of adaptive streaming however is that it allows the end point to adjust to changes in available bandwidth and delay.</p>
<p>Apple’s HTTP Live Streaming, SilverLight Smooth Streaming from Microsoft and Adobe’s HTTP Dynamic Streaming are notable examples of this technology. These applications manage a set of tracks, each with a different bit rate. The tracks are then divided into small segments (typically 2 seconds) for distribution to a wide variety of different display profiles.</p>
<p>The main innovation here is that if an endpoint sees a dynamic change in bandwidth (up or down) it can react by changing “tracks” to a higher- or lower-bandwidth track, offering the optimal quality. With a limited amount of local buffering the end point can guarantee an acceptable user experience. The popularity of commercial adaptive streaming applications proves that the quality is critical for Internet video.</p>
<p><a href="http://www.octasic.com/blog/wp-content/uploads/2011/11/ps_adaptivestreaming.png"><img class="alignright size-large wp-image-405" title="throughput adaptive streaming" src="http://www.octasic.com/blog/wp-content/uploads/2011/11/ps_adaptivestreaming-1024x327.png" alt="" width="707" height="225" /></a></p>
<p>Unlike traditional streaming protocols, HTTP based progressive download offers “smooth”  video transmission. These streaming techniques are very effective for stored premium content. In these scenarios, the content is encoded into multiple tracks “off-line” and then placed on the server for consumption. It is less useful for user-generated content, were the number of video clips is orders of magnitudes larger (think of YouTube for example). In those situations, it makes more sense to adapt the bandwidth on the fly for the requested clip (see below).</p>
<p><strong>Video delivery Network</strong></p>
<p>We’ve discussed video delivery from the point of view of the client and the server. The network infrastructure in between is another key factor in efficient delivery. Reducing the data sent by a central server to many clients will significantly reduce the possibility of congestion and packet loss. Multicasting, distributing and caching the data to edge nodes is an efficient way to reduce the original data throughput requirement from the central server. The edge node may also be responsible for dynamically adapting the content to the different profiles required by various terminal devices.</p>
<p>The Content Delivery Network (CDN) model is often used for Internet video delivery. In a CDN, the video source or stream originates from a file server or broadcast TV headend. Popular video streams are stored (cached) at many or all of the CDN edge nodes. When a client requests the stream, it will be transparently redirected to the &#8220;closest&#8221; edge node. CDN architectures and optimization are quite an elaborate topic that we’ll save for another post.</p>
<p>CDNs help alleviate congestion in the core of a network like the public internet. However with the popularity of mobile video, the problem is slightly different. We not only need to reduce congestion in the core network, we now need to reduce congestion on the wireless link as well. To solve this problem we’re seeing edge routers become more sophisticated. Edge routers aimed at wireless deployments now have embedded DSP capabilities to do inline transrating: they can dynamically change the video bitrate to reduce it when network congestion occurs. These routers perform deep packet inspection (DPI) to determine that the contents are video, and then redirect the media to a DSP for dynamic transcoding/transrating.</p>
<p>As video continues to consume a larger and larger percentage of mobile data traffic, the combination of adaptive streaming, CDNs and inline transrating will become commonplace. In future posts we’ll go into more detail of each of these technologies.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.octasic.com/blog/2011/11/03/an-introduction-to-internet-video-delivery/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>HTML5 Video &#8211; Microsoft weighs in</title>
		<link>http://www.octasic.com/blog/2011/09/28/html5-video-microsoft-weighs-in/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=html5-video-microsoft-weighs-in</link>
		<comments>http://www.octasic.com/blog/2011/09/28/html5-video-microsoft-weighs-in/#comments</comments>
		<pubDate>Wed, 28 Sep 2011 19:21:39 +0000</pubDate>
		<dc:creator>James Awad</dc:creator>
				<category><![CDATA[Opinion]]></category>

		<guid isPermaLink="false">http://www.octasic.com/blog/?p=275</guid>
		<description><![CDATA[<p><img class="alignright size-full wp-image-298" title="Treehouse photo by Linus Bohman (http://www.flickr.com/photos/bohman/)" src="http://www.octasic.com/blog/wp-content/uploads/2011/09/pp_flashandmicrosoft.png" alt="" width="280" height="350" /></p>
<p>A lot of noise and fuss has come out of the HTML5 video debate. There are in fact (at least) two debates going on. The first is Flash vs. HTML5. Here, it’s an old-world vs. new-world debate. The PC-era was good to Flash, but as mobile devices take on more clout, things like battery life, security, and touch-interfaces are used as arguments to kill flash and move to something new. The second debate is from within the HTML5 camp itself, where the choice of video codec has become a prickly subject. That subject should be left for another post.</p>
<p>The big news last week was that Microsoft announced how their forthcoming Internet Explorer 10 (IE10) will NOT support plug-ins, including Flash. Actually, this isn&#8217;t entirely true. The &#8220;Metro-style&#8221; version of IE10 will not support these plug-ins. There will also be a desktop version, but it&#8217;s not where Microsoft is headed. Desktop applications are seen as legacy support. <a href="http://www.zdnet.com/blog/microsoft/microsoft-to-developers-metro-is-your-future/10611" target="_blank">This blog</a> describes the differences quite well. This was obviously a bold decision, but anchored in the same reasoning as what I mentioned above. IE10 will be part of Windows 8 which is promoting touch interfaces, and better integration with Windows Phone 7.&#8230;</p>]]></description>
				<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-298" title="Treehouse photo by Linus Bohman (http://www.flickr.com/photos/bohman/)" src="http://www.octasic.com/blog/wp-content/uploads/2011/09/pp_flashandmicrosoft.png" alt="" width="280" height="350" /></p>
<p>A lot of noise and fuss has come out of the HTML5 video debate. There are in fact (at least) two debates going on. The first is Flash vs. HTML5. Here, it’s an old-world vs. new-world debate. The PC-era was good to Flash, but as mobile devices take on more clout, things like battery life, security, and touch-interfaces are used as arguments to kill flash and move to something new. The second debate is from within the HTML5 camp itself, where the choice of video codec has become a prickly subject. That subject should be left for another post.</p>
<p>The big news last week was that Microsoft announced how their forthcoming Internet Explorer 10 (IE10) will NOT support plug-ins, including Flash. Actually, this isn&#8217;t entirely true. The &#8220;Metro-style&#8221; version of IE10 will not support these plug-ins. There will also be a desktop version, but it&#8217;s not where Microsoft is headed. Desktop applications are seen as legacy support. <a href="http://www.zdnet.com/blog/microsoft/microsoft-to-developers-metro-is-your-future/10611" target="_blank">This blog</a> describes the differences quite well. This was obviously a bold decision, but anchored in the same reasoning as what I mentioned above. IE10 will be part of Windows 8 which is promoting touch interfaces, and better integration with Windows Phone 7.</p>
<p>We saw the first inklings of trouble for Flash when <a href="http://www.apple.com/hotnews/thoughts-on-flash/" target="_blank">Steve Jobs announced</a> that iOS devices would offer no support for Flash or Flash video. They took a lot of heat for it, and we eventually saw some Android phones touting Flash support as an important differentiating feature! It’s obvious today that Apple will not change course, and with Microsoft following suit, a trend is forming.</p>
<p>Slowly but surely we’ve seen people taking sides in this debate. Content owners and video portals were coming down on either side of the fence. Hulu for example has gone on record saying that HTML5 does not <a href="http://blog.hulu.com/2010/05/13/pardon-our-dust/" target="_blank">“meet all of our customers’ needs”</a> (See &#8220;An Aside on HTML5&#8243;). YouTube launched an HTML5 “experiment” but warns of many shortcomings, such as limitations with full screen viewing and poor support for ads. I’m sure YouTube’s viewers are ok with ad-free videos, but Google’s management won’t get behind HTML5 if it kills off their revenue streams!</p>
<p>So the race is on between Adobe and the HTML5 world. Who can develop tools that developers love? Adobe hasn’t given up the fight. They’re now rolling out a final version of Adobe Flash Player 11, which will run across all popular desktop and mobile OSes including iOS (using Adobe AIR 3). In my opinion Adobe will carve itself out a nice niche, but eventually, as HTML5/Javascript tools become more complete, the overwhelming number of web-based sites and developers will prevail. We’ll see competitive tools come out that allow the creation of sophisticated games, media players and applications inside the browser itself.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.octasic.com/blog/2011/09/28/html5-video-microsoft-weighs-in/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>À La Carte Backhaul</title>
		<link>http://www.octasic.com/blog/2011/09/21/a-la-carte-backhaul/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=a-la-carte-backhaul</link>
		<comments>http://www.octasic.com/blog/2011/09/21/a-la-carte-backhaul/#comments</comments>
		<pubDate>Wed, 21 Sep 2011 17:09:24 +0000</pubDate>
		<dc:creator>Emmanuel Gresset</dc:creator>
				<category><![CDATA[Wireless]]></category>
		<category><![CDATA[backhaul]]></category>
		<category><![CDATA[basestation]]></category>
		<category><![CDATA[bts]]></category>
		<category><![CDATA[rural]]></category>
		<category><![CDATA[wireless]]></category>

		<guid isPermaLink="false">http://www.octasic.com/blog/?p=261</guid>
		<description><![CDATA[<p><img class="alignleft size-full wp-image-263" title="pp_ruralbackhaul" src="http://www.octasic.com/blog/wp-content/uploads/2011/09/pp_ruralbackhaul.png" alt="" width="680" height="374" />As a follow up to my article on <a href="http://e-ditionsbyfry.com/Olive/ODE/WDD/Default.aspx?href=WDD%2F2011%2F06%2F01&#38;pageno=10&#38;view=document">Wireless Expansion in Developing Areas</a> I wrote not long ago for Wireless Design and Development magazine, I asked myself if I could go one step further and explain how we made it possible to provide backhaul. More specifically, how our client can easily leverage the features we added to the latest DSP to provide cellular and backhaul concurrently using a single multi-core device. All of this while keeping everything at low cost, low power and not affecting the ease of installation.</p>
<p>In my <a href="http://e-ditionsbyfry.com/Olive/ODE/WDD/Default.aspx?href=WDD%2F2011%2F06%2F01&#38;pageno=10&#38;view=document">previous article</a>, we were using a System on Chip (SoC) DSP that would be the core processor of a compact BTS setup. The very same SoC can be used to provide the backhaul simultaneously to the cellular portion. With its multiple independent RF interfaces, our clients can connect both cellular and backhaul antenna directly on one device. The backhaul is also flexible, offering the option to use OFDM or cellular UE standard with high performance radio. Since we wanted that our clients be able to customize and adapt the backhaul modem to their specific needs (LOS, NLOS), we designed our DSP to be C programmable. That’s a lot &#8230;</p>]]></description>
				<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-263" title="pp_ruralbackhaul" src="http://www.octasic.com/blog/wp-content/uploads/2011/09/pp_ruralbackhaul.png" alt="" width="680" height="374" />As a follow up to my article on <a href="http://e-ditionsbyfry.com/Olive/ODE/WDD/Default.aspx?href=WDD%2F2011%2F06%2F01&amp;pageno=10&amp;view=document">Wireless Expansion in Developing Areas</a> I wrote not long ago for Wireless Design and Development magazine, I asked myself if I could go one step further and explain how we made it possible to provide backhaul. More specifically, how our client can easily leverage the features we added to the latest DSP to provide cellular and backhaul concurrently using a single multi-core device. All of this while keeping everything at low cost, low power and not affecting the ease of installation.</p>
<p>In my <a href="http://e-ditionsbyfry.com/Olive/ODE/WDD/Default.aspx?href=WDD%2F2011%2F06%2F01&amp;pageno=10&amp;view=document">previous article</a>, we were using a System on Chip (SoC) DSP that would be the core processor of a compact BTS setup. The very same SoC can be used to provide the backhaul simultaneously to the cellular portion. With its multiple independent RF interfaces, our clients can connect both cellular and backhaul antenna directly on one device. The backhaul is also flexible, offering the option to use OFDM or cellular UE standard with high performance radio. Since we wanted that our clients be able to customize and adapt the backhaul modem to their specific needs (LOS, NLOS), we designed our DSP to be C programmable. That’s a lot of option for backhaul, all this while still having a cellular cell simultaneously covering around 20km. This design flexibility allows multiple backhaul options within a large cell SoC processor.</p>
<p>We want our clients to have a lot of flexibility to create their low cost, low power and easy to install BTS solution. In that mindset, we added a PCIe and USB interface to our SoC. This opens many interesting and original ways to take advantage the DSP. For example, using the PCIe or USB to connect a low cost Wi-FI module, we can provide outdoor Wi-FI for short range internet service. All that would be needed would be to connect a Wi-Fi module that will interact directly with the processor.</p>
<p>With three RF interfaces, inputs like for PCIe/USB and flexible backhaul. We created a Small Cell Processor  that lets System Designers run their imagination wild. I am looking forward to seeing the great things our clients will do with our device.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.octasic.com/blog/2011/09/21/a-la-carte-backhaul/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Roadtrips in 2015</title>
		<link>http://www.octasic.com/blog/2011/08/19/roadtrips-in-2015/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=roadtrips-in-2015</link>
		<comments>http://www.octasic.com/blog/2011/08/19/roadtrips-in-2015/#comments</comments>
		<pubDate>Fri, 19 Aug 2011 21:25:20 +0000</pubDate>
		<dc:creator>James Awad</dc:creator>
				<category><![CDATA[Opinion]]></category>
		<category><![CDATA[4G]]></category>
		<category><![CDATA[ipad]]></category>
		<category><![CDATA[lte]]></category>
		<category><![CDATA[video]]></category>
		<category><![CDATA[video conferencing]]></category>

		<guid isPermaLink="false">http://www.octasic.com/blog/?p=165</guid>
		<description><![CDATA[<p><img class="alignleft size-full wp-image-229" title="pp_roadtrip2015" src="http://www.octasic.com/blog/wp-content/uploads/2011/08/pp_roadtrip2015.png" alt="" width="408" height="180" />Summer time is the perfect chance for people to hit the road with their families and see the sights. Roadtrips have been a North-American tradition for decades. They always sound like a good idea when you head out, but by the third day of “Are we there yet?” the parents usually want to strangle someone. There was a clear trend this year however: instead of seeing 100$ DVD players strapped in front of children’s faces, I saw lots of iPads in their place. I was also guilty of this. Playing video on a tablet is just so much easier than shuffling through DVD discs, the custom battery charger, carrying an extra gadget. And most importantly, the screen is a lot bigger. But this isn’t an ad for Apple. The same would have been true for any tablet. In fact, less expensive tablets will probably start showing up more frequently, attached to headrests in cars everywhere in lieu of cheap DVD players.</p>
<p>Fast-forward a few years, and what do you think we’ll see? Those same tablets will have pervasive 3G/4G connections. And those same kids will grow up a little. Within 1-2 years, I’m convinced we’ll see the proliferation of “backseat &#8230;</p>]]></description>
				<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-229" title="pp_roadtrip2015" src="http://www.octasic.com/blog/wp-content/uploads/2011/08/pp_roadtrip2015.png" alt="" width="408" height="180" />Summer time is the perfect chance for people to hit the road with their families and see the sights. Roadtrips have been a North-American tradition for decades. They always sound like a good idea when you head out, but by the third day of “Are we there yet?” the parents usually want to strangle someone. There was a clear trend this year however: instead of seeing 100$ DVD players strapped in front of children’s faces, I saw lots of iPads in their place. I was also guilty of this. Playing video on a tablet is just so much easier than shuffling through DVD discs, the custom battery charger, carrying an extra gadget. And most importantly, the screen is a lot bigger. But this isn’t an ad for Apple. The same would have been true for any tablet. In fact, less expensive tablets will probably start showing up more frequently, attached to headrests in cars everywhere in lieu of cheap DVD players.</p>
<p>Fast-forward a few years, and what do you think we’ll see? Those same tablets will have pervasive 3G/4G connections. And those same kids will grow up a little. Within 1-2 years, I’m convinced we’ll see the proliferation of “backseat video chatting”. Does that sound crazy? It’ll be free, thanks to applications like Skype that just launched their full-screen video iPad app last week. Once these apps are well integrated to social media and presence, you’ll always know who you can “hang-out” with, as Google+ calls it. In fact, this may create the problem that Google+ is trying to solve with their “Hang-outs”. As far as I know, people aren’t just sitting at their computer, thinking, “I wish I could video chat my friends”. But when you’re stuck in the car for 8 hours, you’ll talk to anyone!</p>
<p>We’ll check back in a few years to see if this comes true. In the meanwhile, we’ll stick to desktop video conferencing.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.octasic.com/blog/2011/08/19/roadtrips-in-2015/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Applied Psychoacoustics: Or How to Mutilate an Audio Signal and Get Away With It</title>
		<link>http://www.octasic.com/blog/2011/06/15/applied-psychoacoustics-or-how-to-mutilate-an-audio-signal-and-get-away-with-it/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=applied-psychoacoustics-or-how-to-mutilate-an-audio-signal-and-get-away-with-it</link>
		<comments>http://www.octasic.com/blog/2011/06/15/applied-psychoacoustics-or-how-to-mutilate-an-audio-signal-and-get-away-with-it/#comments</comments>
		<pubDate>Wed, 15 Jun 2011 20:16:42 +0000</pubDate>
		<dc:creator>Jean-Marc Valin</dc:creator>
				<category><![CDATA[Audio]]></category>
		<category><![CDATA[Media Processing]]></category>
		<category><![CDATA[audio]]></category>
		<category><![CDATA[audio codec]]></category>
		<category><![CDATA[G.711]]></category>
		<category><![CDATA[opus codec]]></category>
		<category><![CDATA[psychoacoustics]]></category>
		<category><![CDATA[speech]]></category>

		<guid isPermaLink="false">http://www.octasic.com/blog/?p=73</guid>
		<description><![CDATA[<h3><img class="alignleft size-full wp-image-232" title="pp_psychoacoustics" src="http://www.octasic.com/blog/wp-content/uploads/2011/06/pp_psychoacoustics.png" alt="" width="408" height="180" />Psychoacoustics</h3>
<p>As Wikipedia puts it: &#8220;Psychoacoustics is the scientific study of sound perception&#8221;. While there&#8217;s a lot of theoretical research on the topic, one of the main application of psychoacoustics is lossy audio coding. One of the first codecs to make use of psychoacoustic tricks &#8212; long before MP3 was born &#8212; is the G.711 (u-law/A-law) codec. In general, lossy audio codecs attempt to reduce the bitrate by coding the audio signal with just enough accuracy to avoid the distortion being audible.</p>
<h3>What you can get away with</h3>
<p>There are many types of distortion that can be inflicted on an audio signal without causing too much audible degradation. Here are some examples below.</p>
<h4>Phase distortion</h4>
<p>The human ear is almost completely insensitive to the phase of signals. For example, we can&#8217;t distinguish between a waveform and its inverted version (the only reason loudspeakers have a red and a black connector is to avoid wiring them 180 out-of-phase with each other and getting cancellation effects). As long as the phase distortion is constant (or nearly constant) in time and that the variation in group delay across frequencies isn&#8217;t enough to cause temporal smearing, then the phase can take a lot of &#8230;</p>]]></description>
				<content:encoded><![CDATA[<h3><img class="alignleft size-full wp-image-232" title="pp_psychoacoustics" src="http://www.octasic.com/blog/wp-content/uploads/2011/06/pp_psychoacoustics.png" alt="" width="408" height="180" />Psychoacoustics</h3>
<p>As Wikipedia puts it: &#8220;Psychoacoustics is the scientific study of sound perception&#8221;. While there&#8217;s a lot of theoretical research on the topic, one of the main application of psychoacoustics is lossy audio coding. One of the first codecs to make use of psychoacoustic tricks &#8212; long before MP3 was born &#8212; is the G.711 (u-law/A-law) codec. In general, lossy audio codecs attempt to reduce the bitrate by coding the audio signal with just enough accuracy to avoid the distortion being audible.</p>
<h3>What you can get away with</h3>
<p>There are many types of distortion that can be inflicted on an audio signal without causing too much audible degradation. Here are some examples below.</p>
<h4>Phase distortion</h4>
<p>The human ear is almost completely insensitive to the phase of signals. For example, we can&#8217;t distinguish between a waveform and its inverted version (the only reason loudspeakers have a red and a black connector is to avoid wiring them 180 out-of-phase with each other and getting cancellation effects). As long as the phase distortion is constant (or nearly constant) in time and that the variation in group delay across frequencies isn&#8217;t enough to cause temporal smearing, then the phase can take a lot of abuse without anyone noticing. The audio samples below show the effect of applying an all-pass filter that completely distorts the phase, but leaves the amplitudes unmododified.</p>
<table cellpadding="0">
<tr>
<td>
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="40" height="40" id="audio01" align="middle"><br />
<embed src="/wavplayer.swf?gui=mini&#038;h=20&#038;w=300&#038;sound=/blog/wp-content/uploads/2011/06/ref2.wav&#038;" bgcolor="#ffffff" width="40" height="40" allowScriptAccess="always" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" /><br />
</object><br />
<a href="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref2.wav">original</a>
</td>
<td>
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="40" height="40" id="audio02" align="middle"><br />
<embed src="/wavplayer.swf?gui=mini&#038;h=20&#038;w=300&#038;sound=/blog/wp-content/uploads/2011/06/allpass.wav&#038;" bgcolor="#ffffff" width="40" height="40" allowScriptAccess="always" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" /><br />
</object><br />
<a href="http://www.octasic.com/blog/wp-content/uploads/2011/06/allpass.wav">modified phase</a>
</td>
</tr>
</table>
<p>In practice, few codecs really take into account the irrelevance of phase because it&#8217;s a very hard to do it right. The main class of codecs that take advantage of this in some way are the very low bitrate parametric vocoders, such as LPC10 and MELP. These codecs just transmit a set of voice synthesis parameters (typically the pitch, the gain, and the shape of the spectrum) and send them to a decoder that does not even attempt to recover the phase.</p>
<h4>Properly shaped noise</h4>
<p>Noise that resembles the signal in both spectrum and temporal envelope can be very hard to hear. This is why perceptual audio codecs can achieve good audio quality despite having a really low signal-to-noise ratio (SNR). On that topic, it&#8217;s often said that lossy codecs such as MP3 &#8220;discarding frequencies that cannot be heard&#8221;. That&#8217;s just not how it works in practice. What really happens is that instead of representing each frequency component with the equivalent of 16-bit precision (which is what CDs and most wav files use), one can selectively use a much lower precision. The result is quantization noise, which lossy codecs attempt to shape in such a way as to make it inaudible.<br />
Here&#8217;s an example that demonstrates just how much difference it can make. The same file was quantized with Vorbis at 64 kb/s, as well as with a 6-bit PCM quantizer (like a 6-bit wav file). The Vorbis file has an SNR of 16 dB, while the 6-bit PCM file has an SNR of 18 dB.</p>
<table cellpadding="0">
<tr>
<td>
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="40" height="40" id="audio03" align="middle"><br />
<embed src="/wavplayer.swf?gui=mini&#038;h=20&#038;w=300&#038;sound=/blog/wp-content/uploads/2011/06/ref2.wav&#038;" bgcolor="#ffffff" width="40" height="40" allowScriptAccess="always" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" /><br />
</object><br />
<a href="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref2.wav">original</a>
</td>
<td>
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="40" height="40" id="audio04" align="middle"><br />
<embed src="/wavplayer.swf?gui=mini&#038;h=20&#038;w=300&#038;sound=/blog/wp-content/uploads/2011/06/ref2_dec.wav&#038;" bgcolor="#ffffff" width="40" height="40" allowScriptAccess="always" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" /><br />
</object><br />
<a href="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref2_dec.wav">Vorbis (16 dB SNR)</a>
</td>
<td>
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="40" height="40" id="audio05" align="middle"><br />
<embed src="/wavplayer.swf?gui=mini&#038;h=20&#038;w=300&#038;sound=/blog/wp-content/uploads/2011/06/ref2_quant6.wav&#038;" bgcolor="#ffffff" width="40" height="40" allowScriptAccess="always" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" /><br />
</object><br />
<a href="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref2_quant6.wav">6-bit quantization (18 dB SNR)</a>
</td>
</tr>
</table>
<p>It&#8217;s pretty obvious that despite having 2 dB more noise, the Vorbis file sounds much better. The difference is that instead of having a constant, white quantization noise, the noise is shaped and sounds like <a href="http://www.octasic.com/blog/wp-content/uploads/2011/06/vorbis_noise.wav">this</a>.</p>
<h3>What you cannot get away with</h3>
<p>Despite the fact that audio signals can tolerate a lot of abuse, there are some things that you just cannot do. The following artefacts will tend to make audio sound really bad.</p>
<h4>Frame discontinuities (aka blocking artefacts)</h4>
<p>In video coding, blocking artefacts can be somewhat annoying. In audio coding/processing, blocking artefacts are just so bad that it disqualifies any algorithm that cannot avoid them. This is why audio codecs use the Modified Discrete Cosine Transform (MDCT), rather than the plain Discrete Cosine Transform (DCT) used in most video codecs. The modification in the MDCT is the addition of a smooth transition (overlap) between the two DCTs. This makes a big difference as can be seen in the example below. Both files are coded at 40 kb/s using the Opus codec, but for one of them, Opus was modified to use the DCT rather than the MDCT.</p>
<table cellpadding="0">
<tr>
<td>
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="40" height="40" id="audio06" align="middle"><br />
<embed src="/wavplayer.swf?gui=mini&#038;h=20&#038;w=300&#038;sound=/blog/wp-content/uploads/2011/06/ref2_mdct.wav&#038;" bgcolor="#ffffff" width="40" height="40" allowScriptAccess="always" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" /><br />
</object><br />
<a href="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref2_mdct.wav">MDCT</a>
</td>
<td>
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="40" height="40" id="audio07" align="middle"><br />
<embed src="/wavplayer.swf?gui=mini&#038;h=20&#038;w=300&#038;sound=/blog/wp-content/uploads/2011/06/ref2_dct.wav&#038;" bgcolor="#ffffff" width="40" height="40" allowScriptAccess="always" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" /><br />
</object><br />
<a href="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref2_dct.wav">DCT</a>
</td>
</tr>
</table>
<h4>Pre-echo</h4>
<p>Pre-echo is the main problem that transform audio codecs (e.g. those based on the MDCT) try avoiding. It happens because transform codecs have a tendency to have their quantization noise spread evenly over an entire frame. In the case of a sharp transient, it means that noise appears before the transient, like a reversed echo (thus the term pre-echo). Here&#8217;s an extreme examples coding castanets using G.722.1C, one of the worse audio codecs when it comes to pre-echo:</p>
<table cellpadding="0">
<tr>
<td>
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="40" height="40" id="audio08" align="middle"><br />
<embed src="/wavplayer.swf?gui=mini&#038;h=20&#038;w=300&#038;sound=/blog/wp-content/uploads/2011/06/ref_6.wav&#038;" bgcolor="#ffffff" width="40" height="40" allowScriptAccess="always" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" /><br />
</object><br />
<a href="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref_6.wav">original</a>
</td>
<td>
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="40" height="40" id="audio09" align="middle"><br />
<embed src="/wavplayer.swf?gui=mini&#038;h=20&#038;w=300&#038;sound=/blog/wp-content/uploads/2011/06/sir_48_6.wav&#038;" bgcolor="#ffffff" width="40" height="40" allowScriptAccess="always" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" /><br />
</object><br />
<a href="http://www.octasic.com/blog/wp-content/uploads/2011/06/sir_48_6.wav">coded</a>
</td>
</tr>
</table>
<p>The result is so bad that the castanets sound almost like maracas. There are multiple ways transform codecs use to avoid pre-echo:</p>
<ul style="padding-left: 30px;">
<li>Block switching: using a shorter transform size when transients occur</li>
<li>Temporal noise shaping (TNS): optimizing the fine temporal shape of the noise</li>
<li>Time-frequency transformations (in the Opus codec): increasing the time resolution for bands that contain transients</li>
</ul>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.octasic.com/blog/2011/06/15/applied-psychoacoustics-or-how-to-mutilate-an-audio-signal-and-get-away-with-it/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
<enclosure url="http://www.octasic.com/blog/wp-content/uploads/2011/06/allpass.wav" length="2838314" type="audio/wav" />
<enclosure url="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref2_dec.wav" length="2838446" type="audio/wav" />
<enclosure url="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref2_quant6.wav" length="2838446" type="audio/wav" />
<enclosure url="http://www.octasic.com/blog/wp-content/uploads/2011/06/vorbis_noise.wav" length="2838446" type="audio/wav" />
<enclosure url="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref2_mdct.wav" length="2838198" type="audio/wav" />
<enclosure url="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref2_dct.wav" length="2838198" type="audio/wav" />
<enclosure url="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref_6.wav" length="2838866" type="audio/wav" />
<enclosure url="http://www.octasic.com/blog/wp-content/uploads/2011/06/sir_48_6.wav" length="2835326" type="audio/wav" />
<enclosure url="http://www.octasic.com/blog/wp-content/uploads/2011/06/ref2.wav" length="1301548" type="audio/wav" />
		</item>
		<item>
		<title>Winter is over&#8230;spring is here&#8230;at least for those in telecom enterprise</title>
		<link>http://www.octasic.com/blog/2011/05/31/winter-is-over-spring-is-here-at-least-for-those-in-telecom-enterprise/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=winter-is-over-spring-is-here-at-least-for-those-in-telecom-enterprise</link>
		<comments>http://www.octasic.com/blog/2011/05/31/winter-is-over-spring-is-here-at-least-for-those-in-telecom-enterprise/#comments</comments>
		<pubDate>Tue, 31 May 2011 17:14:48 +0000</pubDate>
		<dc:creator>Andrew Faulkner</dc:creator>
				<category><![CDATA[Media Processing]]></category>
		<category><![CDATA[Opinion]]></category>
		<category><![CDATA[enterprise]]></category>
		<category><![CDATA[PBX]]></category>
		<category><![CDATA[unified communications (UC)]]></category>

		<guid isPermaLink="false">http://www.octasic.com/blog/?p=47</guid>
		<description><![CDATA[<p><img class="alignleft size-full wp-image-236" title="pp_spring" src="http://www.octasic.com/blog/wp-content/uploads/2011/05/pp_spring.png" alt="" width="408" height="180" />According to the latest <a href="http://www.infonetics.com/newsletters/Enterprise-Voice-Video-and-Unified-Communication-051911.html">Infonetics Research Newsletter</a> the Enterprise telecom market segment showed strong year-on-year growth over the same period last year (1Q10 to 1Q11). Areas singled out were Unified Communication (UC) PBx, SIP trunking session border controllers, telepresence and video conferencing –the latter two no doubt fuelled (no apology for the pun) or driven (I apologise for that one) by the extortionate raise in gas prices.</p>
<p>The good news is that this growth is consistent with other economic trends, i.e. it is not a bubble, anomaly or any other one off event. The enterprise segment unlike the consumer segment has always been in synch with macro economics. Organizations, under the steely grip of the Chief Finance Officer (CFO), rarely spend money when none is coming in (unlike your wife&#8230;or husband!). In poor economic conditions enterprises preserve cash and refrain from major upgrades or purchases &#8211; PBxs and video conferencing equipment included. However under an improving <a href="http://www.telegraph.co.uk/finance/economics/8498429/US-jobs-figures-beat-forecasts-and-invigorate-the-markets.html">economic outlook</a>, as is the case today, purse strings loosen and investments showing a good Return on Investment (ROI) get approved.</p>
<p>The UC PBx has often proven to be such an investment, leading to improvements in company efficiency and effectiveness. Older TDM PBxs, &#8230;</p>]]></description>
				<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-236" title="pp_spring" src="http://www.octasic.com/blog/wp-content/uploads/2011/05/pp_spring.png" alt="" width="408" height="180" />According to the latest <a href="http://www.infonetics.com/newsletters/Enterprise-Voice-Video-and-Unified-Communication-051911.html">Infonetics Research Newsletter</a> the Enterprise telecom market segment showed strong year-on-year growth over the same period last year (1Q10 to 1Q11). Areas singled out were Unified Communication (UC) PBx, SIP trunking session border controllers, telepresence and video conferencing –the latter two no doubt fuelled (no apology for the pun) or driven (I apologise for that one) by the extortionate raise in gas prices.</p>
<p>The good news is that this growth is consistent with other economic trends, i.e. it is not a bubble, anomaly or any other one off event. The enterprise segment unlike the consumer segment has always been in synch with macro economics. Organizations, under the steely grip of the Chief Finance Officer (CFO), rarely spend money when none is coming in (unlike your wife&#8230;or husband!). In poor economic conditions enterprises preserve cash and refrain from major upgrades or purchases &#8211; PBxs and video conferencing equipment included. However under an improving <a href="http://www.telegraph.co.uk/finance/economics/8498429/US-jobs-figures-beat-forecasts-and-invigorate-the-markets.html">economic outlook</a>, as is the case today, purse strings loosen and investments showing a good Return on Investment (ROI) get approved.</p>
<p>The UC PBx has often proven to be such an investment, leading to improvements in company efficiency and effectiveness. Older TDM PBxs, supporting voice only, are no longer suitable for today’s “always connected” office worker. The UC PBX provides a platform on which multiple real-time communication services such as instant messaging (chat), presence information, telephony (IP telephony), video conferencing, and other collaboration tools may be connected.</p>
<p>As you would expect, companies driving and benefiting from this growth are market leaders Cisco and Avaya, who between them account for more than 60% of the total UC market. Following closely behind them are the “others” that include Mitel, Siemens, NEC and Alcatel-Lucent.<br />
As more PBXs become connected it follows that the network also needs upgrading. This led to a 70% jump in enterprise session boarder controllers (SBCs). Beneficiaries of this boom include Samsung, NEC, Newport Networks as well as Alcatel-Lucent.</p>
<p>The final market segment, with a sunny horizon, is video conferencing and telepresence. According to the report this market will more than double by 2015. While Cisco is once again a Key player, others also got a jump on video, these include Radvison and Polycom among others.</p>
<p>In my opinion wholesale adoption of video in the enterprise is long overdue, and market growth still somewhat understated. I say this for a couple of reasons; firstly in the wake of recent oil prices travel is just too expensive and increasingly difficult to justify. Secondly, and I speak from personal experience, business travel is just not what it used to be. Flying, especially in economy, is uncomfortable, time consuming, and with the heightened security an invasion of privacy.<br />
So a note to all those CFO’s, do your bit&#8230;. for the economy, the environment and the quality of life of your employers -buy a UC PBx and mandate video conferencing.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.octasic.com/blog/2011/05/31/winter-is-over-spring-is-here-at-least-for-those-in-telecom-enterprise/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced
Database Caching using disk: basic

 Served from: www.octasic.com @ 2013-05-20 01:37:49 by W3 Total Cache -->