Meta (formerly Facebook) is introducing its first artificial intelligence offering since the AI generator industry exploded in late 2022.
The brand’s text-to-audio generator, called Voicebox is expected to be the voice equivalent of ChatGPT, which processes text prompts into detailed written results, and Dall-E which develops realistic artwork. Voicebox in turn will be able to take text prompts and produce audio clips, according to Engadget.
Having trained the new generator on over “50,000 hours of unfiltered audio,” including public domain speech and transcripts in English, French, Spanish, German, Polish, and Portuguese. Voicebox is prepared to develop results in conversational-sounding speech in a variety of available languages. Meta also claims its model has a one percent error rate degradation, in comparison to other models.
According to Meta researchers, the model was trained by having it predict blocks of speech within a transcript instead of having to develop a body of work from scratch. The tool also has the ability to edit audio clips for unwanted noise or misspoken words, in a similar fashion to editing software for still images, such as Adobe Photoshop.
Meta stated it doesn’t plan to release the Voicebox app or source code to the public currently due to “the potential risks of misuse.” This is understandable as recently, the Federal Bureau of Investigation (FBI) issued a warning about the increasing use of deep fake content in crimes, including extortion, blackmail, and harassment.
The company has released audio samples with its research paper introducing the app. It also detailed potential future plans to aid “patients with vocal cord damage, in-game NPCs, and digital assistants.”
Meta is in an interesting position of trying to keep up with the current industry trends. Despite having several models of its Meta Quest VR headsets, it appears the company is no longer moving forward with its plans to develop its metaverse concept in favor of more AI innovation. Meanwhile, Apple recently introduced its first Vision Pro headset and is investing in virtual reality. Currently, Apple hasn’t showcased any major interest in AI.
Related Posts
Acer reveals Veriton compact PC to tackle the Mac mini with AMD Ryzen and plenty of AI mojo
Acer is making a direct play in that space with the Veriton RA110 AI Mini Workstation, a compact desktop that runs on AMD's Ryzen AI Max+ 395 processor, aimed at the same desk-bound professional who wants power without the tower.
Acer’s Swift Air 14 is a peppy MacBook Neo rival with some cool upgrades and a $699 ask
At a time when even mainstream laptops are creeping toward four-figure price tags, Acer’s latest machine feels refreshingly straightforward. It’s aimed at students, remote workers, and anyone who wants a laptop that looks and feels expensive without draining their bank account. The Swift Air 14 is powered by Intel’s new Core Series 3 processors and delivers up to 19 hours of battery life. That’s the sort of endurance that could realistically get many users through a full workday and beyond without scrambling for a charger.
Google Drive can now batch-scan your documents and spare you a few other frustrations, too
Well, Google Drive's new document scanner redesign fixes all three problems at once. Announced by Sameer Samat, the President of Android Ecosystem at Google, the feature is now rolling out for Android users.