#speech-recognition

[ follow ]
#natural-language-processing
Artificial intelligence
fromTechCrunch
2 months ago

MLCommons and Hugging Face team up to release massive speech data set for AI research | TechCrunch

MLCommons and Hugging Face released a large public domain voice recording dataset for AI research, promoting global speech technology development.
fromInfoQ
6 months ago
Data science

University of Chinese Academy of Sciences Open-Sources Multimodal LLM LLaMA-Omni

LLaMA-Omni outperforms traditional baseline models in speech and text processing while requiring less training data and compute resources.
fromHackernoon
8 months ago
Artificial intelligence

Datasets and Evaluation Define the Robustness of Speech Language Models | HackerNoon

The article discusses the methods and datasets used for training and evaluating speech-language models (SLMs) against adversarial attacks.
fromHackernoon
8 months ago
Data science

AccentFold: Enhancing Accent Recognition - AccentFold | HackerNoon

AccentFold enhances speech recognition for diverse African accents, improving model accuracy for various dialects.
fromHackernoon
3 weeks ago
Scala

Why Our Tiny Training Set Beat Giants in Cross-Lingual Speech Retrieval | HackerNoon

The proposed DE model excels at speech-to-text (S2T) retrieval, outperforming existing models despite limited training data.
Artificial intelligence
fromTechCrunch
2 months ago

MLCommons and Hugging Face team up to release massive speech data set for AI research | TechCrunch

MLCommons and Hugging Face released a large public domain voice recording dataset for AI research, promoting global speech technology development.
fromInfoQ
6 months ago
Data science

University of Chinese Academy of Sciences Open-Sources Multimodal LLM LLaMA-Omni

LLaMA-Omni outperforms traditional baseline models in speech and text processing while requiring less training data and compute resources.
fromHackernoon
8 months ago
Artificial intelligence

Datasets and Evaluation Define the Robustness of Speech Language Models | HackerNoon

The article discusses the methods and datasets used for training and evaluating speech-language models (SLMs) against adversarial attacks.
fromHackernoon
8 months ago
Data science

AccentFold: Enhancing Accent Recognition - AccentFold | HackerNoon

AccentFold enhances speech recognition for diverse African accents, improving model accuracy for various dialects.
fromHackernoon
3 weeks ago
Scala

Why Our Tiny Training Set Beat Giants in Cross-Lingual Speech Retrieval | HackerNoon

The proposed DE model excels at speech-to-text (S2T) retrieval, outperforming existing models despite limited training data.
more#natural-language-processing
#openai
fromHackernoon
3 years ago
JavaScript

Building a Voice Transcription and Translation App with OpenAI Whisper and Streamlit | HackerNoon

Using Streamlit and OpenAI's Whisper, users can easily record and transcribe speech to text, enhancing interactive web app functionalities.
fromBusiness Insider
8 months ago
Privacy professionals

ChatGPT appears to be getting confused again - this time in Welsh

OpenAI's ChatGPT is facing glitches causing it to respond in incorrect languages due to issues with its speech recognition tool, Whisper.
fromHackernoon
3 years ago
JavaScript

Building a Voice Transcription and Translation App with OpenAI Whisper and Streamlit | HackerNoon

Using Streamlit and OpenAI's Whisper, users can easily record and transcribe speech to text, enhancing interactive web app functionalities.
fromBusiness Insider
8 months ago
Privacy professionals

ChatGPT appears to be getting confused again - this time in Welsh

OpenAI's ChatGPT is facing glitches causing it to respond in incorrect languages due to issues with its speech recognition tool, Whisper.
more#openai
NYC startup
fromTechzine Global
1 month ago

How Juvoly built its own AI speech recognition to beat OpenAI's Whisper

Juvoly harnesses AI to enhance speech recognition tailored for non-English medical conversations, addressing significant shortcomings of existing models like OpenAI's Whisper.
#apple
Apple
fromwww.bbc.com
2 months ago

Apple AI tool transcribed the word 'racist' as 'Trump'

Apple is addressing a glitch in its speech-to-text tool after it mistakenly transcribed "racist" as "Trump".
Apple
fromwww.mercurynews.com
2 months ago

Apple to fix iPhone dictation glitch that suggests replacing the word racist' with Trump'

Apple is fixing a dictation bug that suggests 'Trump' when words with 'R' consonants are spoken.
Apple
fromThe Verge
2 months ago

Apple is fixing a voice dictation bug that substitutes "Trump" for "racist"

Apple's dictation feature has a bug that substitutes "Trump" for "racist."
The company is addressing the glitch related to speech recognition in iPhones.
Apple
fromwww.bbc.com
2 months ago

Apple AI tool transcribed the word 'racist' as 'Trump'

Apple is addressing a glitch in its speech-to-text tool after it mistakenly transcribed "racist" as "Trump".
Apple
fromwww.mercurynews.com
2 months ago

Apple to fix iPhone dictation glitch that suggests replacing the word racist' with Trump'

Apple is fixing a dictation bug that suggests 'Trump' when words with 'R' consonants are spoken.
Apple
fromThe Verge
2 months ago

Apple is fixing a voice dictation bug that substitutes "Trump" for "racist"

Apple's dictation feature has a bug that substitutes "Trump" for "racist."
The company is addressing the glitch related to speech recognition in iPhones.
more#apple
Artificial intelligence
fromTNW | Deep-Tech
2 months ago

'Sorry, I didn't get that': AI misunderstands some people's words more than others

Automatic speech recognition systems struggle to understand diverse speech patterns and accents, often leading to frustrations for users.
#meta
fromArs Technica
3 months ago
Artificial intelligence

Meta takes us a step closer to Star Trek's universal translator

Meta's Seamless translation system translates speech in real-time across 36 languages while preserving voice and emotional tone.
fromTechCrunch
2 months ago
Artificial intelligence

Meta launches new program to improve speech and translation AI | TechCrunch

Meta is partnering with UNESCO to develop openly available AI by collecting diverse speech recordings and transcriptions.
fromArs Technica
3 months ago
Artificial intelligence

Meta takes us a step closer to Star Trek's universal translator

Meta's Seamless translation system translates speech in real-time across 36 languages while preserving voice and emotional tone.
fromTechCrunch
2 months ago
Artificial intelligence

Meta launches new program to improve speech and translation AI | TechCrunch

Meta is partnering with UNESCO to develop openly available AI by collecting diverse speech recordings and transcriptions.
more#meta
#safety-alignment
fromHackernoon
8 months ago
Artificial intelligence

SLMs Outperform Competitors Yet Suffer Rapid Adversarial Jailbreaks | HackerNoon

SLM models show superior safety alignment and helpfulness compared to traditional models, enhancing understanding of spoken language.
fromHackernoon
8 months ago
Artificial intelligence

SpeechVerse Unites Audio Encoder and LLM for Superior Spoken QA | HackerNoon

The SpeechVerse architecture combines an audio encoder with language models to enhance audio input processing.
fromHackernoon
8 months ago
Artificial intelligence

SLMs Outperform Competitors Yet Suffer Rapid Adversarial Jailbreaks | HackerNoon

SLM models show superior safety alignment and helpfulness compared to traditional models, enhancing understanding of spoken language.
fromHackernoon
8 months ago
Artificial intelligence

SpeechVerse Unites Audio Encoder and LLM for Superior Spoken QA | HackerNoon

The SpeechVerse architecture combines an audio encoder with language models to enhance audio input processing.
more#safety-alignment
fromHackernoon
3 months ago
Miscellaneous

Ablation Study Reveals the Role of Semantic & Acoustic Prompts in SEAMLESSEXPRESSIVELM's Performance | HackerNoon

Chain-of-thought prompting enhances model performance by improving semantic preservation during translation.
#language-learning
fromHackernoon
8 months ago
JavaScript

How to Create a Pronunciation Assessment App (Part 1) | HackerNoon

The tutorial focuses on creating a pronunciation app for German using JavaScript and APIs.
fromZDNET
7 months ago
Online learning

Learn a new language with Babbel, now 69% off

Babbel simplifies language learning with short lessons and a focus on conversation, making it feasible for busy individuals.
fromZDNET
5 months ago
Online learning

Save 69% on a Babbel subscription to learn a new language. Here's how

Babbel offers an accessible and effective way to learn a language through short lessons and practical conversation skills.
fromZDNET
8 months ago
Online learning

Buy a Babbel subscription for 76% off

Lifetime subscription to Babbel Language Learning offers 14 languages and 10,000+ hours of education at a 76% discount.
fromZDNET
9 months ago
Online learning

Get a Babbel subscription for 76% off right now

Lifetime subscription to Babbel Language Learning on sale for $140 (76% off) with 14 languages and 10,000+ hours of online education.
fromHackernoon
8 months ago
JavaScript

How to Create a Pronunciation Assessment App (Part 1) | HackerNoon

The tutorial focuses on creating a pronunciation app for German using JavaScript and APIs.
fromZDNET
7 months ago
Online learning

Learn a new language with Babbel, now 69% off

Babbel simplifies language learning with short lessons and a focus on conversation, making it feasible for busy individuals.
fromZDNET
5 months ago
Online learning

Save 69% on a Babbel subscription to learn a new language. Here's how

Babbel offers an accessible and effective way to learn a language through short lessons and practical conversation skills.
fromZDNET
8 months ago
Online learning

Buy a Babbel subscription for 76% off

Lifetime subscription to Babbel Language Learning offers 14 languages and 10,000+ hours of education at a 76% discount.
fromZDNET
9 months ago
Online learning

Get a Babbel subscription for 76% off right now

Lifetime subscription to Babbel Language Learning on sale for $140 (76% off) with 14 languages and 10,000+ hours of online education.
more#language-learning
fromPycoders
7 months ago
Python

PyCoder's Weekly | Issue #647

Learn to use NumPy's where() function for conditional selections in arrays.
Combining both R and Python can optimize data science workflows.
JavaScript
fromCodeProject
8 months ago

Complete Voice Interaction with ChatGPT

The project effectively combines speech recognition and TTS to facilitate uninterrupted interaction with ChatGPT, enhancing user experience.
fromHackernoon
8 months ago
Artificial intelligence

AccentFold: Enhancing Accent Recognition - Conclusion, Limitations, and References | HackerNoon

AccentFold enhances speech recognition for African accented speech by utilizing accent embeddings based on linguistic relationships, showing a 3.5% WER improvement.
fromZDNET
8 months ago
Online learning

Buy a Babbel subscription for 76% off. Here's how

Babbel Language Learning offers a lifetime subscription with access to 14 languages and 10,000+ hours of online education for $140, aiding busy learners with short lesson plans.
Writing
fromtime.com
9 months ago

A Neurological Disorder Stole Her Voice. Jennifer Wexton Took It Back With AI on the House Floor

Jennifer Wexton regained her voice using AI after a rare neurological disorder affected her speech.
The AI program helped Wexton deliver a speech on the House floor, marking a historic moment in using AI for speeches.
Wexton's experience highlights the importance of Disability Pride Month and the impact of technology in aiding individuals with disabilities.
[ Load more ]