Studio Robotic Control Systems


At a Glance

Facial & Body Tracking System

Vision[Ai]ry Facial Tracking and Body Tracking is the first in a suite of products that uses video analytics to automate the functions of a camera operator. Vision[Ai]ry uses AI-based facial and body detection to locate and track the position of faces and bodies within the video stream directly from the camera.

It then uses these facial and body positions to drive the pan, tilt and zoom axes of the robotic camera system to maintain the desired framing of the face/faces or body/bodies in the image. This eliminates the need for a camera operator to manually adjust for the position of the subject in the image.

Consistent Framing

Vision[Ai]ry reduces the burden on the camera operator by eliminating the need for manual corrections of the camera position to compensate for day-to-day variations in talent seating position, posture, height, and more.

Hands-free Camera Workflow

Framing settings can be saved to templates that can be automatically recalled with robotic presets to provide a hands-free camera workflow when combined with automated production control software such as OverDrive.

High-quality, Consistent Tracking

Vision[Ai]ry improves quality and consistency by automatically tracking on-air movements of the studio talent, driving the robotic camera to provide smooth, consistently well-framed images at all times, eliminating the reliance on a skilled operator.



Specifications Vision[Ai]ry
Number of robots controlled No fixed limit
Minimum PC requirements i7 (9th gen or later) -2.9 GHz, 8 cores, 8 GB RAM, Intel integrated graphics, Solid State driveBody Tracking channel requires a GPU, NVIDIA T1000 or equivalent (Note that at present, the GPU can only support one channel at a time)
Video sources Local, eg: SDI capture cardNDI (NDI|HX is not supported)BlackMagic Declink PCIe card
Number of faces that can be tracked simultaneously Up to 30
Minimum size of face in video 9% of frame
Minimum size of body in video 20% of frame
Minimum visibilility requirement 50% of face
Video formats 1080p/59.94 or 501080i/59.94 or 50720p/59.94 or 50720i/59.94 or 50
Tracking latency* <0.2s

* Delay between when the tracked subject moves outside the deadband and the camera starts to move.

Latest News and Resources

Interested in more info? Send us a note!

We’ll put you in touch with a member of our team to discuss your specific needs.