Research

Building the core kernels of multimodal agentic AI

We design optimized, efficient and high performance vision-language models for video summarization, document understanding and object tracking, alongside RLVR-based reasoning models for domain-specific intelligence and agent tool use

Our state of the art models

Tailor made models for each use case and industry

Owlet-Phi-Audio

Designed for comprehensive video understanding, specially in excels in person identification, tracking, human activity recognition, and object detection

Learn More

Owlet-Safety

A document understanding model designed to parse, interpret, and analyze multilingual documents with exceptional accuracy

Learn More

Owlet

A family of lightweight, efficient models designed for advanced video understanding

Learn More

RZN-Med

causal language model created for medical reasoning on open-ended questions.

Learn More

Join Us in Shaping the Future of AI

Collaborate with our innovative research team or apply to help drive groundbreaking advancements in AI technology

View Career Options

Building the core kernels of multimodal agentic AI

To build AI grounded in phronesis (practical wisdom), integrating context, ethics, and causal reasoning to enable autonomous systems that align with human values, anticipate consequences, and optimize for societal well-being.

Our state of the art models

Owlet-Phi-Audio

Owlet-Safety

Owlet

RZN-Med

Research Blogs

Chapter 4: Group Relative Policy Optimization (GRPO) and beyond

Chapter 3: Actor-Critic methods and deep dive into Proximal Policy Optimisation (PPO)

Chapter 2: Value Based Reinforcement Learning vs Policy Based Reinforcement Learning

Chapter 1: Reinforcement Learning for LLMs in 2025 — A Mathematical and Practical Series

Owlet-HAR-1: Building Better VLM for Human Activity Recognition

Owlet-Safety: Lightweight Model for Video Safety Monitoring

From Function Calling to Agentic Reasoning: Evaluating Tool Use in Modern LLMs

Scaling Reasoning in VLMs with Reinforcement Learning

10 Must-Have Free AI Tools For Video Editors To Skyrocket Your Social Media Engagement

AI-Powered Solutions for Law Enforcement: Enhancing Traffic Violation Detection

Owlet-Phi-2-Audio: Our latest multimodal AI model to jointly understand audio and visual signals

VideoRAG: Our new product feature to revolutionize long-video analytics

Announcing Our New Video Search Platform

Owlet: a family of lightweight models for video understanding

Join Us in Shaping the Future of AI