Skip to main content

The Dawn of Interactive Video Technology

In an age where digital experiences are evolving faster than ever, Odyssey, a London-based AI lab, has launched a groundbreaking research preview of a model that transforms video into interactive worlds. This innovative technology aims to change the landscape of entertainment and engagement by allowing users to interact with videos in real-time. The potential here goes beyond mere entertainment; it represents a significant leap toward an entirely new medium of storytelling.

The Odyssey team initially focused on developing world models for film and game production but stumbled upon a unique opportunity to create a new form of interactive video. This model generates videos that respond to inputs—whether from a keyboard, phone, or even voice commands—creating a dynamic experience that feels like stepping into a digital landscape that is alive and reactive. As the Odyssey team describes it, this technology is reminiscent of an early version of the Holodeck, a staple of science fiction.

How Does Odyssey’s AI Model Work?

At the heart of this technology lies a concept known as a world model. Unlike traditional video rendering methods that produce entire clips at once, world models generate videos frame-by-frame, predicting the next frame based on the current state and user inputs. This process is akin to how large language models predict the next word in a sentence but involves the more complex task of generating high-resolution video frames.

Odyssey explains, “A world model is, at its core, an action-conditioned dynamics model.” With every user interaction, the model considers the current state, the action taken, and the historical context to generate the next video frame, resulting in an organic and unpredictable experience. This is a departure from conventional programming methods that rely on pre-determined logic.

  • Real-time frame generation every 40 milliseconds.
  • Interaction capabilities via various devices.
  • Dynamic, responsive video experiences.

Addressing Challenges in AI-Generated Video

Building a stable AI-generated interactive video system presents numerous challenges. One significant hurdle is the drift phenomenon, where minor errors in frame generation can compound over time, leading to instability. To combat this, Odyssey employs a narrow distribution model, which involves pre-training the AI on general video footage before fine-tuning it on a smaller array of specific environments. This approach may sacrifice some variety, but it enhances overall stability, ensuring a coherent experience for users.

Odyssey is optimistic about the future, claiming they are making rapid progress on their next-generation model, which promises a richer range of pixels, dynamics, and actions.

The Economics of Interactive Video

While the technology is groundbreaking, the cost of running the AI infrastructure is notable. Currently, it costs between £0.80-£1.60 per user-hour to power the experience, utilizing high-performance clusters of H100 GPUs across the US and EU. This price tag might seem steep for streaming video, yet it is significantly more economical than producing traditional game or film content. With improvements in model efficiency, Odyssey anticipates these costs will decrease further.

The Future of Storytelling: Interactive Video

Historically, new technologies have birthed innovative forms of storytelling—from paintings on cave walls to the advent of books, film, and video games. Odyssey posits that AI-generated interactive video is the next evolutionary step in this long line of advancements. If successful, this technology could revolutionize not just entertainment, but also education, advertising, and more.

Imagine a training video where users can actively practice skills in real-time or virtual travel experiences that allow exploration from the comfort of home. The current research preview, while still in its infancy, serves as a tantalizing glimpse into a future where interactive AI-generated worlds become our new playgrounds.

Embrace the Future of Interactive Storytelling.

Conclusion: A New Era of Engagement

The implications of Odyssey’s AI model are profound. As we stand on the brink of what could be a transformative shift in how we engage with video, the excitement surrounding this technology is palpable. Odyssey invites users to experience this preview, marking just the beginning of what promises to be an incredible journey into the realms of interactive experiences.