OpenAI, LLMs, WebRTC, voice bots and Programmable Video
Learn about WebRTC LLM and its applications. Discover how this technology can improve real-time communication using conversational AI.
Read MoreSome believe WebRTC isn’t ready. I think it is ready. But when will WebRTC 1.0 be available?
Ready or not, WebRTC is here. The thing is, we still don’t have a closed standard specification we can all print and take on a plane to read for our enjoyment. There are drafts - but nothing that is final.
And once final, does it mean that it is available?
There are 3 parts that needs to be addressed to answer this question. I’ll deal with only two of them (skipping the IETF one):
Want to learn more about WebRTC, the various components in its specification and what compute power you need for each WebRTC server? Try out my free video course:
Want to learn more about WebRTC, the various components in its specification and what compute power you need for each WebRTC server? Try out my free video course:
WebRTC as a standard is built out of two components:
Most of the industry is already viewing WebRTC as a done deal - so much so that the IETF already has an RFC for SIP over WebSocket. The only reason to have such an RFC is to be able to use SIP inside a browser, and the only way to use SIP inside a browser with media being sent or received would be by way of WebRTC. The people working at the IETF were so certain WebRTC will get an RFC of its own in 2014 already (5 years ago!).
Each of these organizations has its own set of rules, policies, governance and flow.
I’ve tried to keep the standardization of WebRTC at arm’s length. In the past I’ve been part of standardization processes related to H.323 and 3G-324M, going to ITU-T and 3GPP standardization meetings as well as acting as a co-chair of the 3G-324M activity group at the IMTC (dealing with interoperability). It is a tedious work that combines technology with politics. As fun as it is (at times at least), dealing with it as an employee of a company is different than doing it as a consultant. The value for me just wasn’t there.
For vendors? If you want to take a driver’s seat at this, and decide what gets more attention, then you should invest time in it.
But where are we with WebRTC then?
I’ve asked Dominique Hazael-Massieux about WebRTC’s status. He works as a W3C Staff dealing with WebRTC. Here’s what I got -
When it comes to W3C, where the browser WebRTC APIs are being defined, WebRTC is considered to be at the CR stage.
CR means a Candidate Recommendation. We’ve moved from a Working Draft (WD) towards a Candidate Recommendation.
Next up would be PR - Proposed Recommendation, and from there, a Recommendation.
How do we move to the next step?
That first one is “easy”. Get the people writing the spec into a room. Have them agree. Then have someone write down the agreement on “paper”. Get everyone to read it. And agree again. Rinse and repeat. It’s never easy.
That second one of implementing in browsers? That’s also not easy. They have other things on their minds as well. And WebRTC is pretty darn complex to implement. But we’re getting there.
That third one of interoperability testing? With a test suite. That tests for the various features? This is downright suicidal. And daunting.
All that work needs to be done for “free”. There’s no direct money to be made out of it. But lots of hours needs to be spent by many people to get it done. We’re getting there, but we’re not there yet.
And then there are the browser implementations.
The specification is as good as its implementations. People always complain when I suggest following the Chrome behavior in WebRTC as opposed to implementing against the specification. That’s where theory and expectations meets reality.
At the end of the day, your service will need to:
In the first case, Chrome wins on market share; Microsoft Edge will be migrating to Chromium. And for most use cases, Chrome is the first browser to target anyway.
In the second case, if you are using the code in webrtc.org for your app, then you are effectively basing your app on Chrome’s WebRTC implementation.
Better go with what’s available now than what will be ready some time in the future.
In the past, the changes we’ve seen in browser implementations of WebRTC revolved a lot around media optimizations and interoperability across browsers. What we are seeing now a lot more is changes in the API layer, where browsers are shifting towards the WebRTC 1.0 specification. This is necessary because:
These changes mean one sad thing though. You can be certain in one thing - during 2019, WebRTC implementations in browsers is going to break existing apps multiple times. This is due to the changes taking place. We are seeing migration from Plan B towards Unified Plan, modifications to the connection state machine, and an experimental implementation of mDNS. There’s more that I probably forgot and more ahead of us still.
The only certainty is that nothing is certain. You’ll need to continue investing in aligning with the browser implementations with each and every browser version release.
The current intent is to be able to get to the PR stage for WebRTC somewhere in Q3 2019. Will it be postponed further? I don’t really know.
Interestingly, work has started in parallel about WebRTC NV - what comes next. I’ve covered the WebAssembly in WebRTC part of it in the past.
Want to learn more about WebRTC, the various components in its specification and what compute power you need for each WebRTC server? Try out my free video course:
Learn about WebRTC LLM and its applications. Discover how this technology can improve real-time communication using conversational AI.
Read MoreGet your copy of my ebook on the top 7 video quality metrics and KPIs in WebRTC (below).
Read More