Multimodal prompting with a 44-minute movie | Gemini 1.5 Pro Demo
This is a demo of long context understanding, an experimental feature in our newest model, Gemini 1.5 Pro using a 44-minute silent Buster Keaton movie, Sherlock Jr., and a series of multimodal prompts.
This demo is a continuous recording of a live model interaction. Sequences have been shortened with response times shown.
Token count details: The input video (696,161 tokens) and image (256 tokens) total 696,417 tokens. The text inputs add additional tokens into the prompt, yielding the 696,538 token total shown in the interface.
To learn more about Gemini 1.5, visit
Subscribe to our Channel:
Tweet with us on X:
Follow us on Instagram:
Join us on Facebook:
1 view
415
135
1 year ago 00:01:59 3
Multimodal prompting with a 44-minute movie | Gemini 1.5 Pro Demo
2 years ago 00:16:27 126
InvokeAI 2.2 Release
11 months ago 00:28:21 1
Indirect Prompt Injection Into LLMs Using Images and Sounds
1 year ago 00:12:43 1
2024 Tech I’m Ready For!
3 years ago 01:05:04 3
Highlights in AI: is MS Marco in trouble? — October Edition
7 months ago 00:13:33 1
Stable Diffusion ComfyUI Workflow - Using Multimodal Pipeline To Create AI Video
11 months ago 00:01:53 13
Reasoning across a 402-page transcript | Gemini 1.5 Pro Demo
4 years ago 00:17:05 1
AI Weekly Update Overview - July 15th, 2021
1 year ago 02:13:35 24
Let’s build the GPT Tokenizer
12 months ago 00:03:15 1
Problem solving across 100,633 lines of code | Gemini 1.5 Pro Demo
1 year ago 00:59:48 52
[1hr Talk] Intro to Large Language Models
11 months ago 00:23:24 1
Master Claude 3 Haiku - The Crash Course!
2 years ago 00:17:01 1
Midjourney has COMPETITION & it’s FREE/Open Source - Deepfloyd IF AI Art Model