Video Platforms

Mar 06, 2026

Callaba on LinkedIn

More live video workflow notes and product updates.

Use the bitrate calculator to size the workload, or build your own licence with Callaba Self-Hosted if the workflow needs more flexibility and infrastructure control. Managed launch is also available through AWS Marketplace.

Practical latency thresholds engineers use to choose an architecture:

Ultra-low / Interactive <500 ms — required for real-time interactions like gaming, remote control, or live bidding. Typical tech: WebRTC or SRT contribution + WebRTC SFU egress.
Low-latency 500 ms – 3 s — live Q&A, low-delay sports, auctions. Typical tech: SRT contribution with LL-HLS / CMAF distribution or SRT-to-WebRTC edge.
Near-real-time 3 – 10 s — mainstream low-latency live where caching and CDN efficiency are needed. Typical tech: LL-HLS with 2s segments and 200 ms parts; DASH CMAF variants.
Classic live >10 s — broadcast-style HLS/DASH with large CDN caches and simpler packaging.

About SRT: SRT is a UDP-based contribution/backhaul protocol optimized for packet loss and jitter. It uses ARQ (retransmission) inside a jitter buffer controlled by the 'latency' parameter (milliseconds). Typical production latency settings for SRT range from 120 ms (clean networks) to 2000 ms (very lossy links). SRT supports encryption (AES), configurable latency, and connection identifiers; it is not a CDN delivery protocol but a strong contribution/backhaul option.

Decision guide

Pick an architecture by answering four questions: interactivity requirement, audience size, device constraints, and network quality. Below are decision rules and direct mapping to product capabilities.

If you need sub-second E2E latency and most viewers are on browsers or mobile apps:
- Use WebRTC for distribution or SRT-to-WebRTC edge. WebRTC gives the best sub-second results at moderate scale but requires an SFU or scalable real-time infrastructure.
- Map to a video API that includes SFU/e-gress capabilities; see product integration notes at callaba.io/products/video-api.
If you need 0.5–3 s with global scale and browser/legacy compatibility:
- Ingest via SRT for reliable contribution, transcode to ABR, package as LL-HLS (CMAF + parts 200 ms) and serve via CDN.
- Use a video platform that supports SRT ingest and LL-HLS packaging; see callaba.io/products/low-latency and the LL-HLS docs at callaba.io/docs/ll-hls.
If you need very large scale and can tolerate 6–30 s latency:
- Classic HLS/DASH with 6 s segments or larger is simplest, best for heavy caching, and scales efficiently through generic CDNs. Use this when interactivity is not required.
- Platform mapping: use the standard video platform product page for global distribution at callaba.io/products/video-platforms.

Latency budget / architecture budget

Latency budgets make trade-offs explicit. Below are three example budgets and how to allocate milliseconds across components. Measure against these on your test streams.

Target A — Sub-second interactive (400 ms)

Capture + camera pipeline: 30–80 ms
Encode (hardware/low-latency settings): 40–80 ms
Transport (SRT/WebRTC network transit + jitter buffer): 80–120 ms
Server processing (SFU or router): 40–70 ms
Player decode & render buffer: 40–60 ms
Margin for variability: 30–50 ms

Target B — Low-latency live (1.5 s)

Capture + encode: 50–150 ms
SRT ingest jitter buffer: 200–600 ms (set 'latency' param accordingly)
Transcode/packager: 150–300 ms
CDN edge & network to viewer: 200–400 ms
Player buffer: 200–400 ms
Operational margin: 100–200 ms

Target C — Near-real-time / LL-HLS (3 s)

Capture + encode: 50–200 ms
SRT/ingest latency: 300–800 ms
Packaging (2 s segments + 200 ms parts): 700–900 ms (depends on part alignment)
CDN distribution & network: 400–800 ms
Player buffer and playback: 200–400 ms

How to use this: choose the target latency class, then configure each component to be below its allocated ms. Always leave 10–20% margin for spikes and transient packet loss. Instrument each stage so you can measure where time is spent.

Practical recipes

Below are production-ready recipes. Each recipe finishes with concrete configuration values to copy into encoders/servers.

Recipe 1 — Low-latency live event (target 1.5–3 s) using SRT + LL-HLS

Contribution
- Camera/encoder sends to an SRT ingest point. Use AES encryption for transport if required.
- Encoder targets: H.264 baseline/main/high as appropriate; CFR (constant frame rate); keyframe interval = 2 s (GOP = 2s).
Transcoding
- Transcode to an ABR ladder; example ladder (start points):
  - 1080p30: 4500–6000 kbps
  - 720p30: 2500–3500 kbps
  - 480p30: 1000–1600 kbps
  - 360p30: 600–900 kbps
Packaging
- Package to CMAF fMP4 with LL-HLS: segment target-duration = 2 s, part duration = 200 ms.
- Ensure keyframes align to part boundaries where possible (keyframe every 2 s or 1 s depending on part alignment strategy).
CDN & playback
- Use a CDN that supports fragment caching and request coalescing. Set short TTL at origin (sub-second for live segments) and use edge keepalive.
- Player: request LL-HLS capable player and set initial buffer target 2–3 s.

Concrete config targets for this recipe:

SRT ingest: latency = 300 ms for clean networks, increase to 600–1000 ms if packet loss & jitter are present. Enable AES-128/256 encryption if needed. Use a dedicated UDP port & verify firewall rules.
Encoder: keyframe interval = 2 s, profile = high, level = 4.1 (1080p30) or 4.2 (1080p60). Audio AAC 48 kHz 64–128 kbps.
Packager: LL-HLS with EXT-X-PART target-duration = 0.2, EXT-X-MAP present, playlist updates on part completion.

Recipe 2 — Interactive webinar (target sub-second to 500 ms) using WebRTC with SRT contribution

Contribution: remote presenters use SRT for contribution if their device or encoder cannot do WebRTC. Ingest to a low-latency edge that can republish as WebRTC.
Realtime fabric: use an SFU for media distribution. The SFU should accept SRT ingest or WebRTC ingest and forward RTP to viewers.
Fallback: for viewers who need lower CPU usage or older browsers, provide a 2–3 s LL-HLS fallback.

Concrete config targets for this recipe:

SRT ingest: latency = 120–300 ms (aggressive). Use small encoder buffer and low-latency presets on hardware encoders.
SFU: enable simulcast or SVC to reduce CPU on the SFU and provide multiple quality layers to the viewer.
Player: WebRTC client with jitter buffer target 50–150 ms; enable forward error correction at the RTP layer if available.

Ingest via SRT from production site to origin cluster.
Transcode to ABR ladder and simultaneously create RTMP outputs for social endpoints (YouTube/Twitch) and LL-HLS for site visitors.
Use adaptive distribution: LL-HLS for site visitors and RTMP to social endpoints; maintain a classic HLS fallback (6 s segments) for high-latency tolerant viewers.

Concrete config targets for this recipe:

SRT latency = 500–1000 ms when you need higher resilience on long-haul links.
Encoder: use CBR on hardware encoders for RTMP/social outputs, keyframe = 2 s; for LL-HLS ABR use constrained VBR with target bitrates listed in Recipe 1.
Packaging: provide both LL-HLS (2 s / 200 ms parts) and classic HLS (6 s segments) streams from the packager.

Practical configuration targets

These are production-ready numeric targets that you should validate in your environment.

Encoder / capture
- Keyframe interval (GOP): 2 s (e.g., 60 frames for 30 fps, 120 frames for 60 fps).
- Frame rate: keep capture/framerate constant; use CFR to avoid frame drops in packaging.
- MTU: set UDP payload <= 1350 bytes to avoid fragmentation on typical internet paths.
SRT
- Latency parameter: 120–300 ms for low-latency links; 500–1000+ ms for lossy long-haul links.
- Retransmission: rely on SRT ARQ; increase latency to allow more retransmissions if packet loss > 1%.
- Ports: ensure firewall allows UDP on chosen ports and that NAT traversal is tested.
Packaging
- LL-HLS part size: 200 ms (0.2 s) typical; ensure packager writes parts and updates playlists on part boundary.
- Segment length for LL-HLS: 2 s recommended; classic HLS: 6 s.
Bitrates and audio
- 1080p30: 4.5–6 Mbps; 720p30: 2.5–3.5 Mbps; 480p: 1–1.6 Mbps; 360p: 600–900 kbps.
- Audio: AAC-LC, 48 kHz, 64–128 kbps stereo for music/content requiring fidelity.
Player
- Initial buffer for LL-HLS: 2–3 s; for WebRTC: 50–150 ms.
- ABR switching: allow client to switch conservatively under packet loss; aggressive switching increases rebuffering risk.

Limitations and trade-offs

Every performance optimization has costs. Be explicit about them.

Lower latency vs caching: Sub-second and 1–3 s latencies reduce the benefit of CDN caching and increase origin load. You may need more origin capacity or regional packagers.
Cost vs CPU: WebRTC SFUs and real-time transcoders cost more CPU per concurrent viewer than chunked HTTP-based delivery; budget accordingly.
Network variability: Global viewers have variable RTT and packet loss; SRT helps contributors but viewer-side latency depends on geography and last-mile.
Device compatibility: WebRTC gives lowest latency but older devices/browsers need fallbacks such as LL-HLS; support both if you have mixed clients.

Common mistakes and fixes

These are the most common production mistakes and how to fix them quickly.

Incorrect keyframe interval
- Symptom: players wait extra time for a clean segment, increasing latency or producing blank frames.
- Fix: set encoder keyframe interval to 2 s and align to packaging boundaries. For LL-HLS, align keyframes to parts or 1/2 segment boundaries.
Too small SRT latency setting on lossy links
- Symptom: frequent retransmissions and stalls.
- Fix: increase SRT latency from 120 ms to 600–1000 ms depending on measured packet loss/jitter. Monitor SRT stats such as packet loss and retransmissions.
MTU fragmentation
- Symptom: frequent packet loss or high retransmits on UDP.
- Fix: reduce UDP payload to 1350 bytes and test across representative networks.
Using VBR with high variability
- Symptom: large bitrate spikes lead to buffer underruns and packet loss.
- Fix: use constrained VBR or CBR for live contribution; use bitrate shaping at encoder.
No instrumentation
- Symptom: latency spikes with no root cause visibility.
- Fix: instrument each stage with metrics: encoder encode time, SRT RTT/packet loss, transcode time, packaging time, CDN response times and player buffering stats.

Rollout checklist

Use this checklist for deploying a low-latency stream to production.

Define your latency target (sub-500 ms / 1–3 s / 3–10 s).
Choose architecture: WebRTC, SRT+LL-HLS, or classic HLS (see decision guide).
Prepare a staging environment with representative network constraints (RTT, packet loss, jitter).
Configure encoders: keyframe interval = 2 s, CFR, MTU ≤ 1350, bitrate ladder according to recipes.
Configure SRT ingest: set latency and enable encryption if required; validate firewall/NAT rules.
Deploy packagers: LL-HLS with 2 s segments and 200 ms parts; confirm playlists update per part in staging. See packaging notes at callaba.io/docs/ll-hls.
Run load tests that simulate expected concurrency and geographies; measure origin load and CDN edge behavior.
Instrument and set alerts: high packet loss, high SRT retransmits, transcoder CPU over 80%, player rebuffer events > threshold.
Publish fallback streams: classic HLS and RTMP outputs for social platforms if relevant.
Run a soft launch with a subset of viewers before full production.

Example architectures

Three common production architectures with component details and mapping to platform pages.

1) Small event, regional audience (low operational complexity)

Camera → hardware encoder → SRT to origin cluster → single-transcoder node → packager (LL-HLS) → CDN edge → viewers (LL-HLS players)
Map to: SRT ingest + simple transcoder product at callaba.io/products/video-platforms.

2) Interactive webinar (moderate scale, sub-second to low-latency)

Presenters: camera/encoder → SRT ingest to real-time edge
Real-time edge: SRT-to-WebRTC gateway → SFU (WebRTC) for viewers; also generate LL-HLS copy for non-WebRTC clients
Map to: video API and SFU features at callaba.io/products/video-api and docs at callaba.io/docs/srt-guide.

3) Global broadcast at scale (high concurrency, hybrid latency)

Camera → SRT to multi-region ingest → regional packagers (CMAF/LL-HLS) → origin cluster → CDN configured for fragment caching → global viewers
Simultaneous outputs: RTMP to social + LL-HLS for site + classic HLS fallback.
Map to: low-latency product and CDN integration details at callaba.io/products/low-latency and packaging docs at callaba.io/docs/encoder-setup.

Troubleshooting quick wins

When latency or quality is worse than expected, try these targeted fixes in this order (fastest wins first).

Check encoder health: confirm CFR, keyframe interval = 2 s, CPU on encoder < 80%.
Increase SRT latency by 200–500 ms and observe retransmit rates; if retransmits drop and playback stabilizes, leave the higher latency or investigate network path.
Reduce bitrate of the top ABR layer by 10–20% to hit stable throughput and reduce packet loss.
Lower UDP MTU to 1350 bytes and re-test for fragmentation issues.
If viewers in a region suffer higher latency, add a regional packager/origin near that region to reduce last-mile delay.

Next step

Map your use case to a product and start a controlled test. If you want to implement SRT contribution with LL-HLS distribution, try the low-latency product pages and docs:

Start with the product overview at callaba.io/products/video-platforms to understand global deployment options.
For low-latency specific capabilities, review callaba.io/products/low-latency and the LL-HLS guide at callaba.io/docs/ll-hls.
If you plan to integrate programmatically, see the API and SFU details at callaba.io/products/video-api and the SRT setup docs at callaba.io/docs/srt-guide.

Ready to run a proof-of-concept? Start a trial or request a technical consult to map your latency target, audience size and budget to a concrete architecture and a staged rollout plan: callaba.io/get-started.

Video Platforms

Decision guide

Latency budget / architecture budget

Target A — Sub-second interactive (400 ms)

Target B — Low-latency live (1.5 s)

Target C — Near-real-time / LL-HLS (3 s)

Practical recipes

Recipe 1 — Low-latency live event (target 1.5–3 s) using SRT + LL-HLS

Recipe 2 — Interactive webinar (target sub-second to 500 ms) using WebRTC with SRT contribution

Recipe 3 — Global broadcast with social restream and HLS fallback

Practical configuration targets

Limitations and trade-offs

Common mistakes and fixes

Rollout checklist

Example architectures

1) Small event, regional audience (low operational complexity)

2) Interactive webinar (moderate scale, sub-second to low-latency)

3) Global broadcast at scale (high concurrency, hybrid latency)

Troubleshooting quick wins

Next step