Google Bard Gaining Ground, MidJourney 6 Alpha Web, ElevenLabs New Funding

January 30th 2024

AI Powered Tools

A run down of the latest SaaS products and services which leverage AI to you take back some time. Today is all about you content generators out there.

  • Upmetrics AI Assistant - for managers or business leaders looking to automate part of their workflow

  • Jasper AI - for content teams looking for an AI writing tool designed to aid content creation for bloggers, marketers, and businesses

  • Grammarly AI - for anyone writing content who already uses the Grammarly service

  • Writer - cross team service that aids the writing process through generative AI.

News

Midjourney 6 Alpha Web Browser Version Launch

Midjourney has released an alpha version of its platform for web browsers, moving beyond its previous Discord-only model. This new version is initially accessible to the "5,000 Club", users who have created over 5,000 images on the platform. The web interface features a user-friendly design with a dark mode and organizational tools for managing image collections. It offers enhanced control over the image creation process with customizable settings and introduces sliders for adjusting stylization, weirdness, and variety. The platform also includes an explore section for community interaction and advanced editing features like creating variations, upscaling images, and detailed pan and zoom functions. This expansion aims to attract a broader range of digital artists and creators​​.

ElevenLabs' Series B Funding and New AI Voice Products

ElevenLabs, an AI voice startup, has achieved a $1.1 billion valuation following an $80 million Series B funding round. Co-founded by ex-Google and Palantir employees, the company specializes in voice cloning and synthesis using machine learning. They plan to use the capital for advancing research and expanding product offerings. New features include a movie dubbing tool and a marketplace for selling cloned voices. Additionally, ElevenLabs has developed multilingual voice synthesis tools and an AI Dubbing product for translating audio and video content. The company's rapid growth includes over a million users and partnerships with significant content publishers.

Google Bard's Rise in AI Chatbot Rankings

Google's AI chatbot, Bard, powered by the updated Gemini Pro model, has recently ascended to second place in the LMSys Chatbot Arena, trailing behind OpenAI's GPT-4 Turbo. Bard's performance, particularly after integrating the new Gemini Pro-scale model, showcases significant advancements since its initial release. The Arena, which benchmarks various AI models including proprietary and open-source, utilizes human judgment to rank performance. Bard's success signifies Google's growing prominence in AI voice and chatbot technology, potentially challenging OpenAI's dominance.

Research

But a drop in the ocean of research papers coming out on AI these days. Here are two papers that were interesting and help show the advances that continue to be made in the field.

Lumiere: Revolutionizing Video Generation

Lumiere, a groundbreaking text-to-video diffusion model, is transforming digital storytelling with its unique Space-Time U-Net (STUNet) architecture. This enables the generation of entire videos in one pass, diverging from traditional keyframe-based methods. Lumiere excels in creating lifelike, diverse, and seamlessly flowing video content. While currently facing challenges with multi-scene videos and high-resolution outputs, Lumiere sets a new standard in video synthesis, promising future advancements in complex video generation tasks. It symbolizes a significant leap in digital media creation, blurring the lines between reality and rendered content.

Read the full article on Bright Journey AI.

Exploring the Frontiers of AI with Ferret

The article on Bright Journey AI focuses on "Ferret," a groundbreaking Multimodal Large Language Model (MLLM). Ferret excels in understanding and integrating images and text, featuring a unique hybrid region representation for precise spatial understanding. It shows exceptional performance in accurately describing image details and relationships, and in reducing object hallucinations. Developed by Apple researchers, Ferret was trained on the GRIT Dataset and tested using the Ferret-Bench benchmark system. Future research aims to improve its spatial understanding and handle ambiguities, while addressing challenges like dataset dependency and computational resource needs.

Read more at Bright Journey AI.

Reply

or to participate.