Video Understanding Computer

Memories.ai Recognized as a Leading Video Understanding Model for Video Caption

Memories.ai, the pioneering AI company founded by former Meta Reality Labs researchers, today announced it has been recognized as a leading video understanding model for video caption by the ...

Yahoo Finance

Ai2 Releases Molmo 2: State-of-the-Art Open Multimodal Family for Video and Multi-Image Understanding

New open models unlock deep video comprehension with novel features like video tracking and multi-image reasoning, accelerating the science of AI into a new generation of multimodal intelligence.

9to5Mac

Apple trained a large language model to efficiently understand long-form video

Apple researchers have developed an adapted version of the SlowFast-LLaVA model that beats larger models at long-form video analysis and understanding. Here’s what that means. Very basically, when an ...

Wired

This Technique Can Make It Easier for AI to Understand Videos

Whether it’s dubious viral memes, gaffe-prone presidential debates, or surreal TikTok remixes, you could spend the rest of your life trying to watch all the video footage posted on YouTube in a single ...

Science Daily

Computer scientists develop new tool that generates videos from themed text

A global team of computer scientists have developed ''Write-A-Video'', a new tool that generates videos from themed text. Using words and text editing, the tool automatically determines which scenes ...

TV Technology

TwelveLabs to Bring Its State-of-the-Art Video AI Models to Amazon Bedrock

LAS VEGAS--Amazon Web Services (AWS) and TwelveLabs have announced that TwelveLabs' state-of-the-art multimodal foundation models, Marengo and Pegasus, will soon be available in Amazon Bedrock. The ...

Business Wire

Ai2 Releases Molmo 2: State-of-the-Art Open Multimodal Family for Video and Multi-Image Understanding

SEATTLE--(BUSINESS WIRE)--Ai2 (The Allen Institute for AI) today announced Molmo 2, a state-of-the-art open multimodal model suite capable of precise spatial and temporal understanding of video, image ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results