Hello Reader!

Welcome to Visually AI!

The Battle for Top Spot

Remember a few weeks ago we were all blown away by Kling AI?

As we were still waiting for Sora?

Yes, well... we've had some new competitors emerge, and it isn't clear who is going to win this race.

We've got some new faces, including some familiar ones...

Luma AI Dream Machine

I then had a DM from Luma AI.

And I had to keep shush until their announcement.

Introducing: The Dream Machine

Dream Machine is a next-generation AI video model that creates high-quality, realistic shots from text instructions and images.

Available to everyone, it generates 5-second videos with seamless extensions in 5-second increments. The free plan allows up to 30 videos per month, with paid options for more. You can try it here.

Blessed with early access, I couldn't wait to show others some of my creations.

These are all clips I generated using text-only prompts or my Midjourney images plus text prompts.

No editing, including upscale:

You can see more examples in this YouTube video I posted this week.

RunwayML Gen-3

Now, you remember Runway, right?

Runway started off this craze, and they weren't going to be overshadowed.

Runway introduced Gen-3 Alpha, a new model for high-fidelity, controllable video generation.

This model represents a significant improvement over its predecessor, Gen-2, in terms of fidelity, consistency, and motion. Gen-3 Alpha is trained on a new infrastructure designed for large-scale multimodal training, incorporating both videos and images.

It supports various tools such as Text to Video, Image to Video, and Text to Image, along with advanced control modes like Motion Brush, Advanced Camera Controls, and Director Mode.

Key features of Gen-3 Alpha include:

Fine-grained temporal control: Enables imaginative transitions and precise key-framing.
Photorealistic humans: Capable of generating expressive human characters with diverse actions and emotions.
Industry customization: Allows for stylistically controlled and consistent characters tailored to specific artistic and narrative needs.

The model also includes new safeguards, such as an improved visual moderation system and C2PA provenance standards.

Gen-3 Alpha was developed collaboratively by a team of research scientists, engineers, and artists, to interpret a wide range of styles and cinematic terminology.

Unfortunately, Gen-3 is not available to the public yet, and they haven’t announced a release date.

As soon as I hear anything, I will let you know.

Two examples are below:

Hedra Character-1

Another new contender in the mix.

I introduce to you Character-1, Hedra's new foundation model for expressive talking, singing, and acting characters.

Available now on desktop and mobile, it offers a free open preview with 30-second durations and up to 90 seconds of generated content per 60 seconds.

The company says this marks the first step in its mission to create a multimodal studio for complete control over emotional dialogue, movement, and entire worlds.

I was impressed with the expressive characters and the ability to maintain the quality throughout the videos. Try the beta for free here.

These are a few of my results:

🌎 AI Developments All Over The Globe

Microsoft Drops Florence-2:

Microsoft introduced Florence-2, a new unified AI model designed to tackle various vision tasks such as image classification, object detection, and segmentation. This model represents a significant advancement in Microsoft’s AI capabilities, promising improved performance and versatility in visual data processing.

OpenAI’s Former Chief Scientist Starts New AI Company:

The former chief scientist of OpenAI, Ilya Sutskever, has founded a new AI company. This move is expected to bring innovative approaches and advancements in AI technology, leveraging his extensive experience and expertise in the field.

Nvidia Becomes the Most Valuable Company in the World:

Nvidia has surpassed other leading tech companies to become the world’s most valuable company. This milestone underscores Nvidia’s critical role in AI technology and its growing influence in the tech industry, driven by its powerful GPUs and AI solutions.

Feds and Private Sector Run First AI Cyberattack Simulation:

In a pioneering effort, federal agencies and private sector companies conducted the first simulation of an AI-driven cyberattack. This exercise aimed to better understand and prepare for potential AI-enabled cyber threats, enhancing cybersecurity measures and response strategies.

You could have your AI service, tool, or event seen by Visually AI’s community of over 15,000+ subscribers in newsletters or 36,000+ followers on 𝕏 :

ADVERTISE WITH ME

💻 This Week On Visually AI Youtube

Luma AI Dream Machine | Free AI Video

📱 This Week On 𝕏

Heather Cooper

@HBCoop_

I got early access to @hedra_labs Character-1 and it's incredible.
The model generates video and dynamic 3D content with a focus on expressive humans.
Workflow and examples below:
#hedra

5:20 PM • Jun 18, 2024

Retweets

593

Likes

Read 52 replies

Heather Cooper

@HBCoop_

Dream Machine by @LumaLabsAI: Strengths
Dream Machine excels in generating clear, realistic motion from image + text prompts.
3 Examples from my Midjourney images:
#LumaDreamMachine

4:24 PM • Jun 13, 2024

Retweets

376

Likes

Read 32 replies

Heather Cooper

@HBCoop_

Sora who?
Generate your own videos from text or your favorite images right now with @LumaLabsAI Dream Machine!
10 of my favorite examples with prompts:
#LumaDreamMachine

8:11 PM • Jun 12, 2024

Retweets

592

Likes

Read 41 replies

Heather Cooper

@HBCoop_

⚡Verdant Ruins Quest⚡
Generated with @LumaLabsAI Dream Machine
Workflow below:

11:3 PM • Jun 16, 2024

Retweets

122

Likes

Read 11 replies

Heather Cooper

@HBCoop_

Krea Video Enhancer is now available for everyone.
Upscale videos up to 2.5x and120 fps.
Details below:
(Upscaled 2.5x)

4:4 PM • Jun 16, 2024

Retweets

155

Likes

Read 17 replies

Heather Cooper

@HBCoop_

Using Dream Machine to animate WWII images I generated in Midjourney:
@LumaLabsAI

6:19 PM • Jun 15, 2024

Retweets

202

Likes

Read 24 replies

Heather Cooper

@HBCoop_

Dream Machine by @LumaLabsAI
Collab w/ @itsthomashaynes

4:6 PM • Jun 14, 2024

Retweets

Likes

Read 7 replies

🚀 This Week’s AI Tools Picks!

Video to Sound Effects: ElevenLabs' new tool analyzes your video to generate four matching sound effects and lets you download each selection with the sound attached to the video. (link)

The generated sounds are often accurate, as you can see in this video with one of my Krea Video clips:

OnePublish: Cross-publish your content directly from Notion to DEV, Hashnode, Medium, Ghost and more upcoming platforms. (link)

Unicorn Platform: AI-powered website builder designed for startups, solo entrepreneurs, and hackers to create responsive websites effortlessly using ready-made templates. (link)

Glif: A low-code platform for creating AI-powered "glifs" that transform user inputs like text and images into outputs such as text, images, or videos. (link)

Abacus AI: Comprehensive AI platform offering solutions for forecasting, anomaly detection, language processing, personalization, marketing, sales, vision, and fraud detection. (link)

OSSA: Converts your script into professionally edited short-form video, with no editing skills required. (link)

🖼️ Image Prompts

PROMPT: cinematic still, Allied soldiers during the D-Day invasion at Normandy, Soldiers reading letters from home during a brief respite, with expressions of longing and resolve, surrounded by the remnants of battle. Shot from a close angle, emphasizing the personal sacrifices and connections.

PROMPT: A traditional pagoda surrounded by a lake filled with floating lanterns, reflecting the structure's ornate details. The lanterns' warm glow contrasts with the cool tones of the water, creating a peaceful and enchanting scene.