#model-performance

[ follow ]
#machine-learning
fromHackernoon
1 year ago
Artificial intelligence

The Link Between Concept Frequency and AI Performance, Seen Through Images and Words | HackerNoon

fromMedium
1 month ago
Artificial intelligence

Two Indispensable Tools for Measuring the Quality of AI Systems

fromHackernoon
1 year ago
Online learning

Enhancing Rhetorical Role Labeling with Training-Time Neighborhood Learning | HackerNoon

fromHackernoon
1 year ago
Artificial intelligence

The Link Between Concept Frequency and AI Performance, Seen Through Images and Words | HackerNoon

fromMedium
1 month ago
Artificial intelligence

Two Indispensable Tools for Measuring the Quality of AI Systems

fromHackernoon
1 year ago
Online learning

Enhancing Rhetorical Role Labeling with Training-Time Neighborhood Learning | HackerNoon

Artificial intelligence
fromHackernoon
1 year ago

How Dataset Diversity Impacts AI Model Performance | HackerNoon

Pretraining data diversity significantly influences model performance, particularly in generalization and predictive capabilities.
fromHackernoon
6 months ago

Contextualizing SUTRA: Advancements in Multilingual & Efficient LLMs | HackerNoon

Advancements in Large Language Models emphasize the importance of multilingual support to address global linguistic diversity.
#ai-evaluation
fromHackernoon
1 year ago
Artificial intelligence

AI Still Can't Explain a Joke-or a Metaphor-Like a Human Can | HackerNoon

Artificial intelligence
fromInfoWorld
3 months ago

Vector Institute aims to clear up confusion about AI model performance

DeepSeek and OpenAI's o1 models excel in performance, yet AI models still face significant challenges across various tasks.
fromHackernoon
1 year ago
Artificial intelligence

AI Still Can't Explain a Joke-or a Metaphor-Like a Human Can | HackerNoon

Artificial intelligence
fromInfoWorld
3 months ago

Vector Institute aims to clear up confusion about AI model performance

DeepSeek and OpenAI's o1 models excel in performance, yet AI models still face significant challenges across various tasks.
#ai
fromInfoQ
1 month ago
Artificial intelligence

Mistral AI Releases Magistral, Its First Reasoning-Focused Language Model

fromComputerworld
3 months ago
Artificial intelligence

Open AI's new models hallucinate more than the old ones

AI models increasingly produce hallucinations, with newer versions being more prone to inaccuracies.
fromHackernoon
3 months ago
Artificial intelligence

Reconstruction Evaluations Across Varying Amounts of Training Data: Mindeye2 | HackerNoon

Model performance improves with increased training data, particularly in specialized contexts such as medical AI.
fromInfoQ
1 month ago
Artificial intelligence

Mistral AI Releases Magistral, Its First Reasoning-Focused Language Model

fromHackernoon
3 months ago
Artificial intelligence

Reconstruction Evaluations Across Varying Amounts of Training Data: Mindeye2 | HackerNoon

Artificial intelligence
fromTechCrunch
1 month ago

DeepSeek may have used Google's Gemini to train its latest model | TechCrunch

DeepSeek's R1 model may have been trained on outputs from Google's Gemini, raising ethical concerns regarding data sourcing.
Scala
fromHackernoon
9 months ago

What Makes Code LLMs Accurate? | HackerNoon

Pass@1 rates for Lua programming tasks show that quantization level impacts model performance, particularly affecting lower bit models.
#quantization
fromHackernoon
9 months ago

The V-Shaped Mystery of Inference Time in Low-Bit Code Models | HackerNoon

Higher precision results in longer inference times, especially for incorrect solutions.
Longer inference times do not guarantee improved performance across different models.
Online learning
fromHackernoon
1 year ago

Fine-tuned GPT-3.5 Performance for Explanatory Feedback | HackerNoon

Fine-tuning GPT-3.5 enhances its ability to identify praise in tutoring responses even with limited data.
Artificial intelligence
fromHackernoon
3 months ago

How LightCap Sees and Speaks: Mobile Magic in Just 188ms Per Image | HackerNoon

LightCap model achieves real-time image processing on mobile devices, meeting efficiency demands for practical applications.
Software development
fromInfoQ
2 months ago

Windsurf Launches SWE-1 Family of Models for Software Engineering

Windsurf's SWE-1 models support diverse software engineering tasks while improving performance and user experience.
fromHackernoon
7 months ago

Where Glitch Tokens Hide: Common Patterns in LLM Tokenizer Vocabularies | HackerNoon

The study identifies a pattern of untrained tokens across various model families, revealing inefficiencies in tokenizer design.
fromTechCrunch
4 months ago

Researchers say they've discovered a new method of 'scaling up' AI, but there's reason to be skeptical | TechCrunch

Experts are skeptical of the newly proposed AI scaling law called 'inference-time search' despite its potential for improving model performance.
Scala
fromHackernoon
4 months ago

The Future of AI Compression: Smarter Quantization Strategies | HackerNoon

Impact-based parameter selection outperforms magnitude-based criteria in improving quantization for language models.
Artificial intelligence
fromFuturism
5 months ago

OpenAI May Have Really Screwed Up With GPT-4.5

OpenAI's GPT-4.5 is perceived as underwhelming despite claims of being the 'largest and most knowledgeable model'.
High costs and slow performance contribute to skepticism regarding GPT-4.5's value.
[ Load more ]