AI in entertainment

"Last Week in AI" including OpenAI, Stack Overflow, Apple's new Photos app, YouTube Premium, Microsoft MAI-1, Eli Lilly, Audible, Apple's M4 chip, Google's Pixel 8a, machine learning in whale communication, and more.

Last Week in AI: Episode 31

Hey everyone, welcome to this week’s edition of “Last Week in AI.” This week’s stories provide a glimpse into how AI is reshaping industries and our daily lives. Let’s dive in and explore these fascinating developments together.

OpenAI and Stack Overflow Partnership

Partnership Announcement: OpenAI and Stack Overflow have formed a new API partnership to leverage their collective strengths—Stack Overflow’s technical knowledge platform and OpenAI’s language models.

Impact and Controversy: This partnership aims to empower developers by combining high-quality technical content with advanced AI models. However, some Stack Overflow users have protested, arguing it exploits their contributed labor without consent, leading to bans and post reverts by staff. This raises questions about content creator attribution and future model training, despite the potential for improved AI models. Read more

Apple’s New Photos App Feature

Feature Introduction: Apple is set to introduce a “Clean Up” feature in its Photos app update, leveraging generative AI for advanced image editing. This tool will allow users to remove objects from photos using a brush tool, similar to Adobe’s Content-Aware Fill.

Preview and Positioning: Currently in testing on macOS 15, Apple may preview this feature during the “Let Loose” iPad event on March 18, 2023. This positions the new iPads as AI-equipped devices, showcasing practical AI applications beyond chatbots and entertainment. Read more

YouTube Premium’s AI “Jump Ahead” Feature

Feature Testing: YouTube Premium subscribers can now test an AI-powered “Jump ahead” feature, allowing them to skip commonly skipped video sections. By double-tapping to skip, users can jump to the point where most viewers typically resume watching.

Availability and Aim: This feature is currently available on the YouTube Android app in the US for English videos and requires a Premium subscription. It complements YouTube’s “Ask” feature and aims to enhance the viewing experience by leveraging AI and user data. Read more

Microsoft’s MAI-1 Language Model Development

Model Development: Microsoft is developing a new large-scale AI language model, MAI-1, led by Mustafa Suleyman, the former CEO of Inflection AI. MAI-1 will have approximately 500 billion parameters, significantly larger than Microsoft’s previous models.

Strategic Significance: This development signifies Microsoft’s dual approach to AI, focusing on both small and large models. Despite its investment in OpenAI, Microsoft is independently advancing its AI capabilities, with plans to unveil MAI-1 at their Build conference. Read more

AI in Drug Discovery at Eli Lilly

Innovative Discovery: The pharmaceutical industry is integrating AI into drug discovery, with Eli Lilly scientists noting innovative molecular designs generated by AI. This marks a precedent in AI-driven biology breakthroughs.

Industry Impact: AI is expected to propose new drugs and generate designs beyond human capability. This integration promises faster development times, higher success rates, and exploration of new targets, reshaping drug discovery. Read more

AI-Narrated Audiobooks on Audible

Audiobook Trends: Over 40,000 AI-voiced titles have been added to Audible since Amazon launched a tool for self-published authors to generate AI narrations. This makes audiobook creation more accessible but has sparked controversy.

Industry Reaction: Some listeners dislike the lack of filters to exclude AI narrations, and human narrators fear job losses. Major publishers are embracing AI for cost savings, highlighting tensions between creative integrity and commercial incentives. Read more

Apple’s M4 Chip for iPad Pro

Processor Introduction: Apple’s M4 chip, the latest and most powerful processor for the new iPad Pro, offers groundbreaking performance and efficiency.

Key Innovations: The M4 chip features a 10-core CPU, 10-core GPU, advanced AI capabilities, and power efficiency gains. These innovations enable superior graphics, real-time AI features, and all-day battery life. Read more

Google’s Pixel 8a Smartphone

Affordable Innovation: The Pixel 8a, Google’s latest affordable smartphone, is priced at $499 and packed with AI-powered features and impressive camera capabilities.

Key Highlights: The Pixel 8a features a refined design, dual rear camera, AI tools, and enhanced security. It also offers family-friendly features and 7 years of software support. Read more

OpenAI’s Media Manager Tool

Tool Development: OpenAI is building a Media Manager tool to help creators manage how their works are included in AI training data. This system aims to identify copyrighted material across sources.

AI Training Approach: OpenAI uses diverse public datasets and proprietary data to train its models, collaborating with creators, publishers, and regulators to support healthy ecosystems and respect intellectual property. Read more

Machine Learning in Sperm Whale Communication

Breakthrough Discovery: MIT CSAIL and Project CETI researchers have discovered a combinatorial coding system in sperm whale vocalizations, akin to a phonetic alphabet, using machine learning techniques.

Communication Insights: By analyzing a large dataset of whale codas, researchers identified patterns and structures, suggesting a complex communication system previously thought unique to humans. This finding opens new avenues for studying cetacean communication. Read more

Sam Altman’s Concerns About AI’s Economic Impact

CEO’s Warning: Sam Altman, CEO of OpenAI, has expressed significant concerns about AI’s potential impact on the labor market and economy, particularly job disruptions and economic changes.

Economic Threat: Studies suggest AI could automate up to 60% of jobs in advanced economies, leading to job losses and lower wages. Altman emphasizes the need to address these concerns proactively. Read more

AI Lecturers at Hong Kong University

Educational Innovation: HKUST is testing AI-generated virtual lecturers, including an AI version of Albert Einstein, to transform teaching methods and engage students.

Teaching Enhancement: AI lecturers aim to address teacher shortages and enhance learning experiences. While students find them approachable, some prefer human teachers for unique experiences. Read more

OpenAI’s NSFW Content Proposal

Content Policy Debate: OpenAI is considering allowing users to generate NSFW content, including erotica and explicit images, using its AI tools like ChatGPT and DALL-E. This proposal has sparked controversy.

Ethical Concerns: Critics argue it contradicts OpenAI’s mission of developing “safe and beneficial” AI. OpenAI acknowledges potential valid use cases but emphasizes responsible generation within appropriate contexts. Read more

Bumble’s Vision for AI in Dating

Future of Dating: Bumble founder Whitney Wolfe Herd envisions AI “dating concierges” streamlining the matching process by essentially going on dates to find compatible matches for users.

AI Assistance: These AI assistants could also provide dating coaching and advice. Despite concerns about AI companions forming unhealthy bonds, Bumble’s focus remains on fostering healthy relationships. Read more

Final Thoughts

This week’s updates showcase AI’s transformative power in areas like education, healthcare, and digital content creation. However, they also raise critical questions about ethics, job displacement, and intellectual property. As we look to the future, it’s essential to balance innovation with responsibility, ensuring AI advancements benefit society as a whole. Thanks for joining us, and stay tuned for more insights and updates in next week’s edition of “Last Week in AI.”

Last Week in AI: Episode 31 Read More »

Summary of weekly AI news featuring Google Cloud's achievements, legislative updates, and technological innovations across the industry.

Last Week in AI: Episode 27

Welcome to another edition of Last Week in AI. From groundbreaking updates in AI capabilities at Google Cloud to new legislative proposals aimed at transparency in AI model training, the field is buzzing with activity. Let’s dive in!

Google Cloud AI Hits $36 Billion Revenue Milestone

Google Cloud has announced significant updates to its AI capabilities at the Google Cloud Next 2024 event, amidst reaching a $36 billion annual revenue run rate, a substantial increase from five years prior.

Key Takeaways:

  • Impressive Growth: Google Cloud’s revenue has quintupled over the past five years, largely driven by its deep investments in AI.
  • Gemini 1.5 Pro Launch: The new AI model, now in public preview, offers enhanced performance and superior long-context understanding.
  • Expanded Model Access: Google has broadened access to its Gemma model on the Vertex AI platform, aiding in code generation and assistance.
  • Vertex AI Enhancements: The platform now supports model augmentation using Google Search and enterprise data.
  • TPU v5p AI Accelerator: The latest in Google’s TPU series offers four times the compute power of its predecessor.
  • AI-Driven Workspace Tools: New Gemini-powered features in Google Workspace assist with writing, video creation, and security.
  • Client Innovation: Key clients like Mercedes-Benz and Uber are leveraging Google’s generative AI for diverse applications, from customer service to bolstering cybersecurity.

Why It Matters

With its expanding suite of AI tools and powerful new hardware, Google Cloud is poised to lead the next wave of enterprise AI applications.


New U.S. Bill Targets AI Copyright Transparency

A proposed U.S. law aims to enhance transparency in how AI companies use copyrighted content to train their models.

Key Takeaways:

  • Bill Overview: The “Generative AI Copyright Disclosure Act” requires AI firms to report their use of copyrighted materials to the Copyright Office 30 days before launching new AI systems.
  • Focus on Legal Use: The bill mandates disclosure to address potential illegal usage in AI training datasets.
  • Support from the Arts: Entertainment industry groups and unions back the bill, stressing the protection of human-created content utilized in AI outputs.
  • Debate on Fair Use: Companies like OpenAI defend their practices under fair use. This could reshape copyright law and affect both artists and AI developers.

Why It Matters

This legislation could greatly impact generative AI development, ensuring artists’ rights and potentially reshaping AI companies’ operational frameworks.


Meta Set to Launch Llama 3 AI Model Next Month

Meta is gearing up to release Llama 3, a more advanced version of its large language model. Aiming for greater accuracy and broader topical coverage.

Key Takeaways:

  • Advanced Capabilities: Llama 3 will feature around 140 billion parameters, doubling the capacity of Llama 2.
  • Open-Source Strategy: Meta is making Llama models open-source to attract more developers.
  • Careful Progress: While advancing in text-based AI, Meta remains cautious with other AI tools like the unreleased image generator Emu.
  • Future AI Directions: Despite Meta’s upcoming launch, Chief AI Scientist Yann LeCun envisions AI’s future in different technologies like Joint Embedding Predicting Architecture (JEPA).

Why It Matters

Meta’s Llama 3 launch shows its drive to stay competitive in AI, challenging giants like OpenAI and exploring open-source models.


Adobe Buys Creator Videos to Train its Text-to-Video AI Model

Adobe is purchasing video content from creators to train its text-to-video AI model, aiming to compete in the fast-evolving AI video generation market.

Key Takeaways:

  • Acquiring Content: Adobe is actively buying videos that capture everyday activities, paying creators $3-$7 per minute.
  • Legal Compliance: The company is ensuring that its AI training materials are legally and commercially safe, avoiding the use of scraped YouTube content.
  • AI Content Creation: Adobe’s move highlights the rapid growth of AI in creating diverse content types, including images, music, and now videos.
  • The Role of Creativity: Despite the accessibility of advanced AI tools, individual creativity remains crucial, as they become universally accessible.

Why It Matters

Adobe’s strategy highlights its commitment to AI advancement and stresses the importance of ethical development in the field.


MagicTime Innovates with Metamorphic Time-Lapse Video AI

MagicTime is pioneering a new AI model that creates dynamic time-lapse videos by learning from real-world physics.

Key Takeaways:

  • MagicAdapter Scheme: This technique separates spatial and temporal training. Thus, allowing the model to absorb more physical knowledge and enhance pre-trained time-to-video (T2V) models .
  • Dynamic Frames Extraction: Adapts to the broad variations found in metamorphic time-lapse videos, effectively capturing dramatic transformations.
  • Magic Text-Encoder: Enhances the AI’s ability to comprehend and respond to textual prompts for metamorphic videos.
  • ChronoMagic Dataset: A specially curated time-lapse video-text dataset, designed to advance the AI’s capability in generating metamorphic videos.

Why It Matters

MagicTime’s advanced approach in generating time-lapse videos that accurately reflect physical changes showcases significant progress towards developing AI that can simulate real-world physics in videos.


OpenAI Trained GPT-4 Using Over a Million Hours of YouTube Videos

Major AI companies like OpenAI and Meta are encountering hurdles in sourcing high-quality data for training their advanced models, pushing them to explore controversial methods.

Key Takeaways:

  • Copyright Challenges: OpenAI has used over a million hours of YouTube videos for training GPT-4, potentially breaching YouTube’s terms of service.
  • Google’s Strategy: Google claims its data collection complies with agreements made with YouTube creators, unlike its competitors.
  • Meta’s Approach: Meta has also been implicated in using copyrighted texts without permissions, trying to keep pace with rivals.
  • Ethical Concerns: These practices raise questions about the limits of fair use and copyright law in AI development.
  • Content Dilemma: There’s concern that AI’s demand for data may soon outstrip the creation of new content.

Why It Matters

The drive for comprehensive training data is leading some of the biggest names in AI into ethically and legally ambiguous territories, highlighting a critical challenge in AI development: balancing innovation with respect for intellectual property rights.


Elon Musk Predicts AI to Surpass Human Intelligence by Next Year

Elon Musk predicts that artificial general intelligence (AGI) could surpass human intelligence as early as next year, reflecting rapid AI advancements.

Key Takeaways:

  • AGI Development Timeline: Musk estimates that AGI, smarter than the smartest human, could be achieved as soon as next year or by 2026
  • Challenges in AI Development: Current limitations include a shortage of advanced chips, impacting the training of Grok’s newer models.
  • Future Requirements: The upcoming Grok 3 model will need an estimated 100,000 Nvidia H100 GPUs.
  • Energy Constraints: Beyond hardware, Musk emphasized that electricity availability will become a critical factor for AI development in the near future.

Why It Matters

Elon Musk’s predictions emphasize the fast pace of AI technology and highlight infrastructural challenges that could shape future AI capabilities and deployment.


Udio, an AI-Powered Music Creation App

Udio, developed by ex-Google DeepMind researchers, allows anyone to create professional-quality music.

Key Takeaways:

  • User-Friendly Creation: Udio enables users to generate fully mastered music tracks in seconds with a prompt.
  • Innovative Features: It offers editing tools and a “vary” feature to fine-tune the music, enhancing user control over the final product.
  • Copyright Safeguards: Udio includes automated filters to ensure that all music produced is original and copyright-compliant.
  • Industry Impact: Backed by investors like Andreessen Horowitz, Udio aims to democratize music production, potentially providing new artists with affordable means to produce music.

Why It Matters

Udio could reshape the music industry landscape by empowering more creators with accessible, high-quality music production tools.


Final Thoughts

As we wrap up this week’s insights into the AI world, it’s clear that the pace of innovation is not slowing down. These developments show the rapid progress in AI technology. Let’s stay tuned to see how these initiatives unfold and impact the future of AI.

Last Week in AI: Episode 27 Read More »

Overview of the latest advancements and discussions in AI technology, including Grok 1.5, Stable Diffusion 3, Google Gemini's controversy, Reddit's AI integration, Tyler Perry's production pause due to AI, Nvidia's new gaming app, Air Canada's chatbot lawsuit, and Adobe Acrobat's AI assistant.

Last Week in AI: Episode 20

Welcome to this week’s edition of “Last Week in AI.” As we navigate the evolving landscape of artificial intelligence, it’s crucial to stay informed about the latest breakthroughs, debates, and applications. From groundbreaking innovations to ethical dilemmas, this edition covers the pivotal moments in AI that are shaping our future.

X + Midjourney = Partnership?

Elon’s floating the idea of linking X with Midjourney, to spice up how we make content on the platform. This move is all about giving users a new tool to play with, enhancing creativity rather than confusing it. Here’s the takeaway:

  1. AI as a Creative Partner: Musk’s vision is to integrate AI into X, offering a fresh way to craft content. It’s about giving your posts an extra edge with AI’s creative input.
  2. Serious Talks Happening: In a recent chat on X, Musk seemed really into the idea of partnering with Midjourney. It’s not all talk; they’re actively exploring how to bring this feature to life.
  3. Looking Beyond Social Media: Musk has bigger plans than just tweets and likes. He’s thinking about transforming X into a hub for more than just socializing—think shopping, watching stuff, all with AI’s help.

Why You Should Care

Musk’s hint at an AI collab for X is about boosting our creative options, not blending them into a puzzle. If they pull this off, X could set a new trend in how we use social media, making it a go-to for innovative, AI-assisted content creation.


Grok 1.5 Update

Elon Musk dropped another update about Grok 1.5, the latest version of the xAI language model, and it’s got a cool new trick up its sleeve called “Grok Analysis.” It can quickly sum up all the chatter in threads and replies, making sense of the maze so you can get straight to the point or craft your next killer post. Here’s the takeaway:

  • Grok Analysis is the Star: Ever wish you could instantly get the gist on a whole conversation without scrolling for ages? That’s what Grok Analysis is here for.
  • It’s Not Just About Summaries: Musk’s not stopping there. He’s teasing that Grok is going to get even better at reasoning, coding, and doing a bunch of things at once. If Grok 1.5 lives up to the hype, we’re all in for a treat.
  • Coming Soon: The wait won’t be long. Grok 1.5 is expected to drop in the next few weeks, and it’s set to shake things up. If you’re into getting information faster and creating content more easily, keep your eyes peeled.

Why You Should Care

Grok 1.5 is just warming up. With Musk behind it, promising to cut through online noise and beef up our AI toolkit, it’s hard not to get excited.


Stable Diffusion 3 Update

Stable Diffusion 3 is still baking, but the early looks are turning heads. We’re seeing hints of crisper visuals, a smarter grasp on language, and a knack for handling complex requests like a pro. Here’s the takeaway:

  • Exclusive Preview: It’s not out for everyone just yet. There’s a line to get in as they’re still tweaking and taking notes to make sure it’s top-notch at launch.
  • Tech Upgrade: They’ve pumped up the tech from 800 million to a staggering 8 billion parameters. This beast can scale to fit your needs, powered by cutting-edge AI architecture and techniques.
  • Safety First: They’re dead serious about keeping things clean and creative, with checks every step of the way. The aim is to let creativity bloom without stepping over the line.

Why You Should Care

Whether you’re dabbling for kicks or diving in for professional projects, they’re setting the stage for you. And while we all wait for the grand entrance, there’s still plenty to explore with Stability AI’s current offerings.


Google’s Gemini Under Fire

Google’s AI chatbot, Gemini, has landed in hot water due to tipping the scales against white people by often generating images of non-white individuals. Gemini’s staunch refusal to create images based on race has sparked a debate over AI bias and the quest for inclusivity. Here’s the takeaway:

The Pope according to Google's Gemini
credit: X @endwokeness
  • Core Issue: This isn’t just about pictures. It’s a big red flag waving at Google, questioning their duty to craft AI that’s fair and unbiased. The stir over bias is pushing Google to prove their tech mirrors real-world fairness and diversity.
  • The “Go Woke, Go Broke” Debate: “Go woke, go broke,” Google’s push for political correctness might backfire. It’s a tightrope walk between tackling social matters and tech innovation.
  • Leadership Under the Microscope: The heat’s turning up on Google’s execs. There’s chatter that to win back trust, maybe it’s time for some new faces at the helm, hinting that a shake-up could be on the cards.
  • Zooming Out: This whole Gemini drama is just a piece of a larger puzzle. As AI tech grows, the challenge is to make sure it grows right, steering clear of deepening societal divides.

Why You Should Care

Google’s facing the tough task of navigating through the storm with integrity and a commitment to reflecting history accurately. It’s a moment for Google to step up and show it can lead the way in developing AI that truly understands and represents us all.


Reddit AI

Reddit’s striking a deal to feed its endless stream of chats and memes into the AI brain-trust. Why? They’re eyeing a flashy $5 billion IPO and showing off their AI muscle could sweeten the deal. But here’s the twist: not everyone on Reddit is throwing a party about it. Here’s the takeaway:

  • AI’s New Playground: Your late-night Reddit rabbit holes? They could soon help teach AI how to mimic human banter. Pretty wild, right?
  • Big Money Moves: Reddit’s not just flirting with AI for kicks. They’re doing it with big dollar signs in their eyes, thinking it might help them hit it big when they go public.
  • Users Are Wary: Remember when Reddit tried to charge for API access and everyone lost their minds? Yeah, this AI thing is stirring the pot again. Users are side-eyeing the move, worried about privacy and what it means for their daily dose of memes and threads.
  • The Ethical Maze: It’s a bit of a head-scratcher. Using public gab for AI sounds cool but wades into murky waters about privacy and who really owns your online rants.

Why You Should Care

Reddit’s AI gamble is bold, maybe brilliant, but it’s also kicking up a dust storm of debates. As they prep for the big leagues with an IPO, balancing tech innovation with keeping their massive community chill is the game. Let’s watch how this unfolds.


Tyler Perry Halts $800M Production Due to AI

Tyler Perry just hit the brakes on a massive $800 million studio expansion, and guess what? AI’s the reason. After getting a peek at what OpenAI’s Sora can do—think making video clips just from text—Perry’s having a major rethink. Why pour all that cash into more soundstages when AI might just let you whip up scenes without needing all that physical space? Here’s the takeaway:

  • AI Changes the Game: Perry saw Sora in action and it blew his mind. This tool isn’t just cool; it’s a potential game-changer for how movies are made, making the whole “need a big studio” idea kind of outdated.
  • Hold Up on Expansion: So, those plans for bulking up his studio with new soundstages? On ice, indefinitely. Perry’s decision is a big nod to how fast AI’s moving and shaking things up in filmmaking.
  • Thinking About the Crew: It’s not all about tech and savings, though. Perry’s pausing to think about the folks behind the scenes—crew, builders, artists—and how this shift to digital could shake their world.

Why You Should Care

Tyler Perry’s move is a wake-up call: AI’s not just about chatbots and data crunching; it’s stepping onto the movie set, ready to direct. As we dive into this AI-powered future, Perry’s reminding us to keep it human, especially for those who’ve been building the sets, rigging the lights, and making the magic happen behind the camera.


Nvidia’s New App

Nvidia’s rolling out something cool for gamers: a new app that brings everything you need into one spot. Remember the hassle of flipping between the Control Panel and GeForce Experience just to mess with your settings or update your GPU? Nvidia’s new app, which is still in the beta phase, is here to end that headache. Here’s the takeaway:

  • All-in-One Convenience: This app has everything from driver updates to tweaking your graphics settings, including the good stuff like G-Sync, without making you jump through hoops.
  • Streamers, Rejoice: If you’re into streaming, there’s an in-game overlay that makes getting to your recording tools and checking out your performance stats a breeze.
  • AI Magic: For the GeForce RTX crowd, there are AI-powered filters to play with and even AI-optimized textures for sprucing up older games that weren’t originally designed with RTX in mind.
  • Visual Boost: Ever used Digital Vibrance in the Control Panel and thought it could be better? Meet RTX Dynamic Vibrance. It’s here to crank up your visual game to the next level.

Why You Should Care

Nvidia’s new app is all about making your gaming setup simpler and slicker, with a few extra perks thrown in for good measure. If you’re curious, the beta’s up for grabs on Nvidia’s website. Give it a whirl and see how it changes your gaming setup.


Air Canada Loses Court Case Over Chatbot

Air Canada lost a court case due to its chatbot’s mistake. Jake Moffatt sought info on mourning fare from the chatbot, which incorrectly promised a post-trip refund—contrary to Air Canada’s actual policy. After being denied the refund, Moffatt sued. Air Canada tried to pin the error on the chatbot, arguing it should be seen as a separate entity. The court disagreed, ruling the airline responsible for its chatbot’s misinformation, emphasizing that companies can’t dodge accountability for their chatbot’s errors. Here’s the takeaway:

  • Chatbot Confusion: A chatbot trying to help ended up causing a legal headache for Air Canada, showing that even AI can slip up.
  • Courtroom Drama: The court’s decision to hold Air Canada accountable for its chatbot’s mistake is a wake-up call. It’s like saying, “You put it out there, you own it,” which is pretty groundbreaking.
  • Ripple Effect: This case is a heads-up that they need to double-check what their digital helpers are saying.

Why You Should Care

This whole saga with Air Canada and its chatbot is more than just a quirky court case; it’s a landmark decision that puts companies on notice. If your chatbot messes up, it’s on you. It’s a reminder that in the digital age, keeping an eye on AI isn’t just smart—it’s necessary.


Adobe Acrobat AI Assistant

Adobe Acrobat’s new Generative AI feature is shaking things up, making your documents interactive. Need quick insights or help drafting an email? This AI Assistant’s got your back, answering questions with info pulled straight from your docs. And with the Generative Summary, you’re getting the cliff notes version without all the digging. Here’s the takeaway:

Credit: Adobe
  • AI Assistant: It’s helping you navigate documents and prep like a pro.
  • Quick Summaries: Skip the deep dive and get straight to the key points, saving you heaps of time.
  • Wide Access: Available to anyone with Acrobat Standard and Pro, including trial users. Starts with English, but more languages to come.

Why You Should Care

Adobe’s stepping into the future, transforming Acrobat from a simple PDF viewer to a smart, interactive tool that simplifies your work. It’s a glimpse into how tech is making our daily tasks easier and more efficient.


Wrapping Up

That wraps up another week of significant advancements and conversations in the world of AI. As we’ve seen, the realm of artificial intelligence continues to offer both promise and challenges, pushing us to rethink how we interact with technology. Stay tuned for more updates as we continue to explore the vast potential and navigate the complexities of AI together.

Last Week in AI: Episode 20 Read More »

Latest AI Developments: Google Gemini, Apple Vision Pro, and Neuralink's Human Trial

Last Week in AI: Episode 17

Welcome back to “Last Week in AI.” Although, it’s been a slow couple of weeks, we’ve got some pretty groundbreaking stuff to talk about. From Google shaking things up with Gemini, to Apple launching a ton of apps for its Vision Pro platform, making digital interaction more immersive than ever. And let’s not forget Neuralink successfully testing their brain-computer interface in a human for the first time. Let’s break it down for you.

Google

Google’s Bard can now make images using its Imagen 2 model. It’s Google’s answer to ChatGPT Plus. They made sure it’s responsible, with watermarks and no-go zones for certain content. Plus, they dropped ImageFX, a simple tool for making pictures from text. Bard now speaks over 40 languages worldwide.


Key Points:
  • Bard vs. ChatGPT Plus: Google’s stepping up, adding image-making to Bard.
  • Safety First: Watermarks and rules keep things in check.
  • Worldwide Reach: Bard’s now a global player, with a massive language boost.

With Bard and ImageFX, Google’s blending creativity, ethics, and accessibility. It’s smart, it’s global, and it’s responsible. That’s the future of AI they’re betting on.


Gemini

Google’s AI, Gemini, handles text, code, audio, images, and video, all in one. There are three versions: Ultra, Pro, and Nano, with Ultra being a real standout, even outsmarting human experts in understanding language and coding.

Key Points:
  1. Versatility: Gemini’s a jack-of-all-trades, mixing and matching different types of data seamlessly.
  2. Three Models: From Ultra’s heavyweight capabilities to Nano’s mobile-friendly design, there’s something for every need.
  3. Safety and Accessibility: Google’s not cutting corners on safety, checking Gemini for bias and toxicity. It’s getting baked into Google products and is available for developers through Google AI tools.

Google’s Gemini is built to be versatile, accessible, and safe. This is a way for AI to work with us, making life easier for developers and changing how we all interact with technology.


Shopify

Shopify’s now adding a media editor and conversational search to its toolkit with its AI-powered Magic suite. It can tweak photo backgrounds or switch them up entirely—no Photoshop skills needed. It even suggests backgrounds to match what you’ve got, making products shine without the fancy photo shoot. All these AI perks? They’re thrown in for free, knocking down hurdles for entrepreneurs.


Key Points:
  • DIY Photo Magic: Sellers can edit photos like pros, thanks to generative image fill.
  • Convo Search: This isn’t your old-school search; it gets what you’re looking for by understanding the intent, making results way more relevant.
  • Tools Galore: Beyond photos, Shopify’s got AI doing heavy lifting with product descriptions, chatbots, and smart replies, all aimed at easing merchant and buyer chats.

Shopify’s making sure small and big businesses alike get a fair shot. It’s about giving options, not orders. Shopify’s vision? A leveled playing field where all sellers get to shine.


Midjourney

Midjourney’s latest anime-style update, Niji V6 is here. It offers both amateurs and professionals new ways to blend text with imagery. This allows artists to embed words directly into their pictures but also introduces enhanced features for customizing art like never before.


Key Points:
  • Creative Fusion: Niji V6 lets you combine drawings with text, adding a personal touch to your art.
  • Enhanced Control: Features like ‘Vary (Region)’, ‘Pan’, and ‘Zoom’ give artists unprecedented control over their creations.
  • Accessibility: Available to paying users through the Midjourney chatbot, with a full release scheduled for February.

By empowering artists to fuse text and imagery seamlessly, it opens up new possibilities for storytelling and personal expression. Whether you’re just dabbling in digital art or you’re a seasoned pro, Niji V6 promises to inspire and transform the way anime-style art is made.


OpenAI

OpenAI’s ChatGPT users can now pull in specialized GPTs right into their chats. Just hit “@,” pick your GPT, and boom, it’s like adding a new brain to the conversation. They’ve even set up a GPT Store (like an App Store) to make finding and using these GPTs easy. But, not many folks are using them yet, and those who do, it’s dropping. Plus, they hit a bit of a snag with some not-so-great GPTs slipping through, which they’re cleaning up.

Key Points:
  • Customizable Convos: This new feature lets you tailor your ChatGPT conversations with specific GPTs for whatever you need.
  • GPT Store: A marketplace to grab these GPTs, designed to be user-friendly even for non-coders.
  • Challenges Ahead: Adoption’s been slow, and there’s been a bit of trouble with moderation, but OpenAI’s not backing down, planning to let creators earn money from their GPTs soon.

The idea’s solid: more personalized, useful chats. But, they’ve got some hurdles to clear, especially getting more users on board and keeping the GPT Store clean. Still, with plans to monetize, there’s a clear path forward for developers and users alike.


Mistral

Miqu-1-70b” got leaked online and everyone’s talking about it possibly beating GPT-4. The head of Mistral said it was an old model accidentally leaked by someone they work with. But here’s the kicker: they’re working on a new version that might just outdo GPT-4.


Key Points:
  • Leaky Boat: “Miqu-1-70b” has everyone buzzing about it possibly taking on GPT-4.
  • Inside Job: The boss over at Mistral says, oops, it was an older model that got out by accident.
  • Game On: They’re hinting they’ve got something even bigger brewing that could outdo GPT-4.

Mistral’s little accident is now big news, showing everyone just how intense the AI race is getting. And this leak? It might just shake things up, pushing the open-source AI scene into new places and turning up the heat on OpenAI.


Apple

Apple’s doing something big with the Vision Pro. They’ve got 600 new apps hitting the scene. This is about taking computing to a whole new level by blending the digital and real worlds like never before.

The Points:
  1. Wide Range: These apps are all over the map – games, work tools, learning, you name it. It’s about making computing not just something you do, but something you experience.
  2. Top-Notch Tech: The display on this thing is next-level. You’re not just looking at a screen; you’re in it. And you control it with your eyes, hands, voice – however you want.
  3. Big Changes: What Apple’s aiming for here is to change the game. How we watch, work, play, learn – it’s all going to be different with these apps and Vision Pro.

Apple’s Vision Pro is setting a new standard for digital interaction, merging the lines between the virtual and the real. It’s creating experiences that change how we see and interact with the world around us.


Neuralink

Elon Musk’s Neuralink just hit a big milestone: they’ve put their brain-computer interface device into a person for the first time. The patient seems to be doing just fine. Neuralink’s big idea is to let people with serious paralysis use tech like computers and phones just by thinking. They’re calling this brain implant Telepathy, aiming to help folks with conditions like ALS communicate or even use social media directly with their minds.

Key Points:

  1. First Human Trial: Neuralink’s moved from experiments to actually implanting a device in a human, showing they’re on track towards making this tech a reality.
  2. The Goal: The tech is all about translating what’s in your brain into commands for devices, without moving a muscle.
  3. Not Alone: Neuralink’s not the only one in this race. Companies like Synchron and Blackrock Neurotech are also pushing the boundaries of what’s possible with brain-computer interfaces.

Neuralink’s stepping into new territory, blending mind and machine in ways we’ve only dreamed of. This first human trial is a big deal, showing Musk’s vision of merging humans with AI isn’t just sci-fi fantasy anymore.

In Summary

This week has shown us just how fast the world of AI is evolving. Google’s Gemini is setting new standards in versatility, Apple’s Vision Pro apps are redefining user interaction, and Neuralink is pushing the boundaries of what’s possible with neurotechnology. Each of these developments not only highlights the rapid advancements in AI but also hints at the transformative impact these technologies could have on our everyday lives. The future of AI is here and now, and it’s more exciting than ever. Stay tuned for more updates.

Last Week in AI: Episode 17 Read More »