The Race for AI Video begins...


Hello Reader!

Welcome to Visually AI!

The Battle for Top Spot

Remember a few weeks ago we were all blown away by Kling AI?

As we were still waiting for Sora?

Yes, well... we've had some new competitors emerge, and it isn't clear who is going to win this race.

We've got some new faces, including some familiar ones...

Luma AI Dream Machine

I then had a DM from Luma AI.

And I had to keep shush until their announcement.

Introducing: The Dream Machine

Dream Machine is a next-generation AI video model that creates high-quality, realistic shots from text instructions and images.

Available to everyone, it generates 5-second videos with seamless extensions in 5-second increments. The free plan allows up to 30 videos per month, with paid options for more. You can try it here.

Blessed with early access, I couldn't wait to show others some of my creations.

These are all clips I generated using text-only prompts or my Midjourney images plus text prompts.

No editing, including upscale:

You can see more examples in this YouTube video I posted this week.


RunwayML Gen-3

Now, you remember Runway, right?

Runway started off this craze, and they weren't going to be overshadowed.

Runway introduced Gen-3 Alpha, a new model for high-fidelity, controllable video generation.

This model represents a significant improvement over its predecessor, Gen-2, in terms of fidelity, consistency, and motion. Gen-3 Alpha is trained on a new infrastructure designed for large-scale multimodal training, incorporating both videos and images.

It supports various tools such as Text to Video, Image to Video, and Text to Image, along with advanced control modes like Motion Brush, Advanced Camera Controls, and Director Mode.

Key features of Gen-3 Alpha include:

  • Fine-grained temporal control: Enables imaginative transitions and precise key-framing.
  • Photorealistic humans: Capable of generating expressive human characters with diverse actions and emotions.
  • Industry customization: Allows for stylistically controlled and consistent characters tailored to specific artistic and narrative needs.

The model also includes new safeguards, such as an improved visual moderation system and C2PA provenance standards.

Gen-3 Alpha was developed collaboratively by a team of research scientists, engineers, and artists, to interpret a wide range of styles and cinematic terminology.

Unfortunately, Gen-3 is not available to the public yet, and they haven’t announced a release date.

As soon as I hear anything, I will let you know.

Two examples are below:


Hedra Character-1

Another new contender in the mix.

I introduce to you Character-1, Hedra's new foundation model for expressive talking, singing, and acting characters.

Available now on desktop and mobile, it offers a free open preview with 30-second durations and up to 90 seconds of generated content per 60 seconds.

The company says this marks the first step in its mission to create a multimodal studio for complete control over emotional dialogue, movement, and entire worlds.

I was impressed with the expressive characters and the ability to maintain the quality throughout the videos. Try the beta for free here.

These are a few of my results:


🌎 AI Developments All Over The Globe

Microsoft Drops Florence-2:

Microsoft introduced Florence-2, a new unified AI model designed to tackle various vision tasks such as image classification, object detection, and segmentation. This model represents a significant advancement in Microsoft’s AI capabilities, promising improved performance and versatility in visual data processing.

Read more

OpenAI’s Former Chief Scientist Starts New AI Company:

The former chief scientist of OpenAI, Ilya Sutskever, has founded a new AI company. This move is expected to bring innovative approaches and advancements in AI technology, leveraging his extensive experience and expertise in the field.

Read more

Nvidia Becomes the Most Valuable Company in the World:

Nvidia has surpassed other leading tech companies to become the world’s most valuable company. This milestone underscores Nvidia’s critical role in AI technology and its growing influence in the tech industry, driven by its powerful GPUs and AI solutions.

Read more

Feds and Private Sector Run First AI Cyberattack Simulation:

In a pioneering effort, federal agencies and private sector companies conducted the first simulation of an AI-driven cyberattack. This exercise aimed to better understand and prepare for potential AI-enabled cyber threats, enhancing cybersecurity measures and response strategies.

Read more


You could have your AI service, tool, or event seen by Visually AI’s community of over 15,000+ subscribers in newsletters or 36,000+ followers on 𝕏 :


💻 This Week On Visually AI Youtube

Luma AI Dream Machine | Free AI Video

video preview

📱 This Week On 𝕏

twitter profile avatar
Heather Cooper
Twitter Logo
@HBCoop_
11:3 PM • Jun 16, 2024
19
Retweets
122
Likes
twitter profile avatar
Heather Cooper
Twitter Logo
@HBCoop_
6:19 PM • Jun 15, 2024
20
Retweets
202
Likes
twitter profile avatar
Heather Cooper
Twitter Logo
@HBCoop_
4:6 PM • Jun 14, 2024
9
Retweets
72
Likes

🚀 This Week’s AI Tools Picks!

Video to Sound Effects: ElevenLabs' new tool analyzes your video to generate four matching sound effects and lets you download each selection with the sound attached to the video. (link)

The generated sounds are often accurate, as you can see in this video with one of my Krea Video clips:

OnePublish: Cross-publish your content directly from Notion to DEV, Hashnode, Medium, Ghost and more upcoming platforms. (link)

Unicorn Platform: AI-powered website builder designed for startups, solo entrepreneurs, and hackers to create responsive websites effortlessly using ready-made templates. (link)

Glif: A low-code platform for creating AI-powered "glifs" that transform user inputs like text and images into outputs such as text, images, or videos. (link)

Abacus AI: Comprehensive AI platform offering solutions for forecasting, anomaly detection, language processing, personalization, marketing, sales, vision, and fraud detection. (link)

OSSA: Converts your script into professionally edited short-form video, with no editing skills required. (link)


🖼️ Image Prompts

PROMPT: cinematic still, Allied soldiers during the D-Day invasion at Normandy, Soldiers reading letters from home during a brief respite, with expressions of longing and resolve, surrounded by the remnants of battle. Shot from a close angle, emphasizing the personal sacrifices and connections.

PROMPT: A traditional pagoda surrounded by a lake filled with floating lanterns, reflecting the structure's ornate details. The lanterns' warm glow contrasts with the cool tones of the water, creating a peaceful and enchanting scene.


Thank you for reading :)

Before you go, I have a quick question for you!

Would you be interested in reading a book on the "Basics of AI Image Generation" ?

You can vote below:

Visually AI

Your weekly dose of AI news, tools, and innovation with a visual twist. Breaking down barriers to make AI content creation accessible to all. Expressing my own personal takes and educating readers in the world of Artificial Intelligence.

Read more from Visually AI

Hello Reader! Welcome to Visually AI! Generated in Flux FEATURED SPONSOR Get Tailored Social Media Content That Converts Your Ideal Customers A done-for-you strategy and authentic content so that you can focus on growing not grinding. CLICK HERE TO LEARN MORE Super Realistic AI Generated with FLUX Realism LoRA The arrival of super realistic AI image generation, powered by models like Midjourney, FLUX.1, and Ideogram, is transforming the way we create and use visual content. Recently, many...

A surprise... Hello Reader! Welcome to Visually AI! Midjourney Style Reference I have something for you today. The first of multiple resources I am working on... Are you ready to unlock a new creative dimension in your visual projects? Whether you’re creating cinematic shots, ads, promo videos, or simply having fun... I have something which can help your visual generation. Introducing the Midjourney Style Reference Codes w/ Examples — your resource for exploring Midjourney’s latent space and...

Hello Reader! Welcome to Visually AI! Generated in Flux Diving in the Deep End My job is ending. I have been building my brand online for 18 months. It's time to go all in. With my consultancy. With my podcast. With my tutorials. 38k+ following on 𝕏. 15k+ newsletter subs. $60k+ revenue. Closing in on 1000 YouTube subs. This hasn't been easy. Working full time, 4 kids, a full time job. But with the help of my business partner Thomas Haynes, I'm ready to take to the leap. Thank you to everyone...