Stay organized with collections
Save and categorize content based on your preferences.
This page provides the basic requirements for Google Meet Media API client video
codecs to ensure a good user experience.
Functional requirements
This section specifies the requirements for the codec types and features.
Feature
Requirement
Profile
AV1
VP9
VP8
Main
Profile 0
N/A
Scalability
Decoder must support temporal scalability, spatial scalability, or both if required by the
coding specification for given codec type and profile.
Raw video format
Codec must be capable of processing odd resolutions (such as when frame width or height are not
multiple of 2; for example, 133 × 141). For subsampled chroma formats, the number of chroma
samples per dimension must be rounded up when derived from the number of luma samples.
Cropping, padding, or scaling isn't allowed. Resolution of the output frame must match
the resolution of the input frame.
Operational requirements
This section specifies requirements for various conditions the video codec is
expected to operate in. Violation of these requirements may restrict the usage
of the codec to a subset of scenarios but doesn't necessarily prevent it from
being used. For example, if the maximum number of codec instances is less than
required, the codec can still be used together with codecs of other types or
implementations.
Parameter
Requirement
Minimum resolution, pixels
≤ 128 × 128
Maximum resolution, pixels
≥ 2880 × 1800
Minimum frame rate, FPS
≤ 1
Maximum frame rate, FPS
≥ 30
Minimum bitrate, kbps
≤ 30
Maximum bitrate, kbps
≥ 5000
Maximum decoder instances
≥ 3
Maximum total decode throughput, pixels per second
≥ 3 × 2880 × 1880 × 30
Performance requirements
This section specifies the requirements on codec performance. Violation of these
requirements may significantly impact the user experience and will almost
certainly block the usage of the codec.
Parameter
Requirement
Maximum inter frame processing time, seconds
≤ 1 / max(30, encode_target_frame_rate_fps)
Maximum key frame processing time, seconds
≤ 2 / max(30, encode_target_frame_rate_fps)
The frame processing time is the time difference between two events: 1) codec
delivers the frame and 2) codec receives the frame. The average frame processing
time is calculated over a sliding window of at least 10 seconds duration
holding at least 10 frames. The maximum frame processing time is the highest
value among all individual frame processing time values.
The throughput is calculated as the number of processed frames, including
dropped ones, divided by time delta between the two events: 1) codec delivers
the last frame and 2) codec delivers the first frame.
Test cases
The following are test cases you can use:
Tests
Decode throughput
Decode up to three 2880 × 1880 × 30 streams simultaneously and verify that decoders don't
violate the performance requirements.
Decoder conformance
Decode a set of pre-encoded bitstreams. The output of the test decoder must match the output
of a reference decoder (libvpx for VP8 & VP9, libaom for AV1).
Temporal scalability
Decode video with multiple temporal layers (2 and 3). The output of the test decoder must
match the output of the reference decoder.
Spatial scalability
Decode video with multiple spatial layers (2 and 3). The output of the test decoder must match
the output of the reference decoder.
[null,null,["Last updated 2025-08-28 UTC."],[],[],null,["# Video codec requirements for Meet Media API\n\n| **Developer Preview:** Available as part of the [Google Workspace Developer Preview Program](https://developers.google.com/workspace/preview), which grants early access to certain features. \n|\n| **To use the Meet Media API to access real-time media from a conference, the Google Cloud project, OAuth principal, and all participants in the conference must be enrolled in the Developer Preview Program.**\n\nThis page provides the basic requirements for Google Meet Media API client video\ncodecs to ensure a good user experience.\n\nFunctional requirements\n-----------------------\n\nThis section specifies the requirements for the codec types and features.\n\n| Feature | Requirement |||\n|------------------|--------|-------------|-----|\n| Profile | AV1 | VP9 | VP8 |\n| Profile | `Main` | `Profile 0` | N/A |\n| Scalability | Decoder must support temporal scalability, spatial scalability, or both if required by the coding specification for given codec type and profile. |||\n| Raw video format | Codec must be capable of processing odd resolutions (such as when frame width or height are not multiple of 2; for example, 133 × 141). For subsampled chroma formats, the number of chroma samples per dimension must be rounded up when derived from the number of luma samples. Cropping, padding, or scaling isn't allowed. Resolution of the output frame must match the resolution of the input frame. |||\n\nOperational requirements\n------------------------\n\nThis section specifies requirements for various conditions the video codec is\nexpected to operate in. Violation of these requirements may restrict the usage\nof the codec to a subset of scenarios but doesn't necessarily prevent it from\nbeing used. For example, if the maximum number of codec instances is less than\nrequired, the codec can still be used together with codecs of other types or\nimplementations.\n\n| Parameter | Requirement |\n|----------------------------------------------------|---------------|---|---|---|\n| Minimum resolution, pixels | ≤ 128 × 128 |\n| Maximum resolution, pixels | ≥ 2880 × 1800 |\n| Minimum frame rate, FPS | ≤ 1 |\n| Maximum frame rate, FPS | ≥ 30 |\n| Minimum bitrate, kbps | ≤ 30 |\n| Maximum bitrate, kbps | ≥ 5000 |\n| Maximum decoder instances | ≥ 3 |\n| Maximum total decode throughput, pixels per second | ≥ 3 × 2880 × 1880 × 30 ||||\n\nPerformance requirements\n------------------------\n\nThis section specifies the requirements on codec performance. Violation of these\nrequirements may significantly impact the user experience and will almost\ncertainly block the usage of the codec.\n\n| Parameter | Requirement |\n|----------------------------------------------|-----------------------------------------------|\n| Maximum inter frame processing time, seconds | ≤ `1 / max(30, encode_target_frame_rate_fps)` |\n| Maximum key frame processing time, seconds | ≤ `2 / max(30, encode_target_frame_rate_fps)` |\n\nThe frame processing time is the time difference between two events: 1) codec\ndelivers the frame and 2) codec receives the frame. The average frame processing\ntime is calculated over a sliding window of at least **10 seconds** duration\nholding at least **10 frames**. The maximum frame processing time is the highest\nvalue among all individual frame processing time values.\n\nThe throughput is calculated as the number of processed frames, including\ndropped ones, divided by time delta between the two events: 1) codec delivers\nthe last frame and 2) codec delivers the first frame.\n\nTest cases\n----------\n\nThe following are test cases you can use:\n\n| Tests |\n|----------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------|\n| Decode throughput | Decode up to three 2880 × 1880 × 30 streams simultaneously and verify that decoders don't violate the performance requirements. |\n| Decoder conformance | Decode a set of pre-encoded bitstreams. The output of the test decoder must match the output of a reference decoder (`libvpx` for VP8 \\& VP9, `libaom` for AV1). |\n| Temporal scalability | Decode video with multiple temporal layers (2 and 3). The output of the test decoder must match the output of the reference decoder. |\n| Spatial scalability | Decode video with multiple spatial layers (2 and 3). The output of the test decoder must match the output of the reference decoder. |\n\nRelated topics\n--------------\n\n- [Get started with Meet Media API](/workspace/meet/media-api/guides/get-started)"]]