In today’s email:

  • 👼 This AI learned language by seeing the world through a baby’s eyes

  • 🎼 Meta found a way to secretly watermark deep fake audio

  • 🤗 Hugging Face launches open-source AI assistant maker to rival OpenAI’s custom GPTs

  • 🧰 13 new AI-powered tools and resources. Make sure to check the online version for the full list of tools.

Top News

An AI model has learned to recognize words like ‘crib’ and ‘ball’ by studying videos recorded from the perspective of an 18-month-old baby named Sam. This unique approach, detailed in a study published in Science on February 1, 2024, utilized 61 hours of footage from a head-mounted camera to train a neural network. This method of learning through the visual and auditory experiences of a baby provides new insights into human language acquisition, challenging the notion that innate knowledge about language is necessary for learning.

The AI was exposed to 250,000 words and corresponding images from Sam’s daily activities, learning through a process called contrastive learning to associate specific words with images. In tests, the AI successfully matched words to images with a 62% success rate, which is significantly higher than chance and comparable to other AI models trained on much larger datasets. However, its ability to generalize this knowledge to unseen examples varied, performing best with objects that appeared frequently in the training data and had consistent appearances.

The study’s findings offer evidence against theories that suggest language acquisition requires special mechanisms beyond the ability to form associations between different sensory inputs. However, the reliance on data from a single child’s experiences raises questions about the generalizability of the results. Nonetheless, the research highlights the potential of AI to mimic and provide insights into human learning processes, though acknowledging limitations such as the AI’s inability to replicate the tactile and interactive experiences vital to a real baby’s learning.

AudioSeal is a new system by Facebook Research designed to detect AI-generated deepfake audio through imperceptible watermarking. As AI voice synthesis improves, distinguishing between real and fake human speech has become challenging, leading to potential misuse like voice cloning and deepfakes. AudioSeal addresses this by embedding watermarks in synthetic speech, enabling the identification of manipulated audio.

Traditional passive detection methods are becoming ineffective against advanced synthesis technologies. In contrast, AudioSeal uses an active approach, marking generated voices to adapt to technological advancements. It comprises a generator for embedding watermarks and a detector for identifying watermarked audio, offering precise localization and robustness against audio distortions.

AudioSeal surpasses previous models in detection speed and efficiency, providing generalizability, precise manipulation localization, and robustness against editing techniques. However, it also faces challenges such as confidentiality, ethical concerns, and the need for standardization and user consent. Despite its advancements, AudioSeal is part of a broader effort to tackle the evolving challenges of synthetic media, emphasizing the importance of authenticity and ethical practices.

Read the full paper Here.

This is Adeus, the Open Source AI Wearable device – and in this repo, you will be guided on how to set up your own! From buying the hardware (~$100, and it will be cheaper once we finish the Raspberry PI Zero version) to setting up the backend, and the software, and start using your wearable!

Adeus consists of 3 parts:

  1. A mobile / web app: an interface that lets the user to interact with their assistant and data via chat.

  2. Hardware device (Currently Coral AI, but soon a Rasberry-Pi Zero W worth $15): this will be the wearable that will record everything, and send it to the backend to be processed

  3. Supabase: Our backend, and datavase, where we will process and store data, and interact with LLMs. Supabase is an open-source Firebase alternative, a “backend-as-a-service” – that allows you to set up a Postgres database, Authentication, Edge Functions, Vector embeddings, and more – for free (at first) and at extreme ease!

Hugging Face, a New York City-based startup known for its open-source AI code repository has launched a new product called Hugging Chat Assistants. This product allows users to create their own AI chatbots for free, offering a similar service to OpenAI’s custom GPT Builder but without the associated costs. Unlike OpenAI’s offerings, which rely on proprietary models like GPT-4, Hugging Chat Assistants can be powered by a variety of open-source large language models (LLMs), such as Mistral’s Mixtral or Meta’s Llama 2. Hugging Face’s approach emphasizes user customization and the freedom to choose from a wide range of models. Additionally, they have created a central repository where users can share and access customized Assistants. While some in the AI community see Hugging Chat Assistants as superior to GPTs due to their customizability and free access, there are limitations such as the lack of web search capabilities and automatic logo generation, features that OpenAI’s models currently offer. The launch of Hugging Chat Assistants is seen as a significant step in the open-source community’s efforts to offer competitive alternatives to proprietary AI models like those of OpenAI.

Other stuff

Superpower ChatGPT now supports voice 🎉

Text-to-Speech and Speech-to-Text. Easily have a conversation with ChatGPT on your computer

Superpower ChatGPT Extension on Chrome
Superpower ChatGPT Extension on Firefox
Tools & LinkS

Editor’s Pick ✨

RawDog – Generate and auto-execute Python scripts in the CLI

Leap AI – a visual UI tool for building AI workflows with ChatGPT without writing a single line of code.

Brainbase – Use your workforce only where they’re needed

Alvea – An adaptive AI assistant creating unique user interfaces personalized to your tasks just in time

Sensay – Dementia care with lifelike AI replicas

Samwell.Ai – Empowering your academic excellence with AI

Glide – An open high-performance model gateway for your GenAI apps

Podstellar – Transcribe YouTube videos in under 3 minutes

Enhanced GPT Store – Categories, search, sorting and more…

Unclassified 🌀 

⚡️ Be the Highlight of Someone’s Day – Think a friend would enjoy this? Go ahead and forward it. They’ll thank you for it!

Hope you enjoyed today’s newsletter

⚡️ Join over 200,000 people using the Superpower ChatGPT extension on Chrome and Firefox.

Superpower ChatGPT Extension on Chrome
Superpower ChatGPT Extension on Firefox

Source link

What's Your Reaction?

hate hate
confused confused
fail fail
fun fun
geeky geeky
love love
lol lol
omg omg
win win
The Obsessed Guy
Hi, I'm The Obsessed Guy and I am passionate about artificial intelligence. I have spent years studying and working in the field, and I am fascinated by the potential of machine learning, deep learning, and natural language processing. I love exploring how these technologies are being used to solve real-world problems and am always eager to learn more. In my spare time, you can find me tinkering with neural networks and reading about the latest AI research.


Your email address will not be published. Required fields are marked *