#large-language-models

[ follow ]
#performance-optimization
fromHackernoon
4 days ago
Artificial intelligence

Issues with PagedAttention: Kernel Rewrites and Complexity in LLM Serving | HackerNoon

fromHackernoon
4 days ago
Artificial intelligence

Issues with PagedAttention: Kernel Rewrites and Complexity in LLM Serving | HackerNoon

fromHackernoon
3 weeks ago

Behind the Scenes of Self-Hosting a Language Model at Scale | HackerNoon

Large Language Models (LLMs) enable businesses to create tailored applications, but self-hosting them raises complexity—especially regarding privacy and control.
Scala
#generative-ai
fromMedium
4 days ago
Artificial intelligence

Unlocking the Power of Generative AI with Real-Time Data and Advanced Features

Artificial intelligence
fromBusiness Insider
1 month ago

Cybersecurity execs face a new battlefront: 'It takes a good-guy AI to fight a bad-guy AI'

Generative AI introduces new security vulnerabilities, particularly with chatbots, which can lead to significant financial losses for organizations.
Artificial intelligence
fromMedium
4 days ago

Unlocking the Power of Generative AI with Real-Time Data and Advanced Features

Integrating real-time data with advanced AI capabilities enhances the accuracy and scalability of generative AI applications.
Artificial intelligence
fromHackernoon
1 year ago

Prompt Injection Is What Happens When AI Trusts Too Easily | HackerNoon

Generative AI is becoming essential in daily life, but it poses significant security threats like prompt injection, which can manipulate AI systems.
#ai
Artificial intelligence
fromFuturism
2 weeks ago

AI Models Show Signs of Falling Apart as They Ingest More AI-Generated Data

AI models are at risk of collapse due to reliance on AI-generated data for training, causing a cycle of degraded output.
Artificial intelligence
fromInfoQ
1 week ago

Anthropic Open-sources Tool to Trace the "Thoughts" of Large Language Models

Anthropic has open-sourced a tool to trace internal workings of large language models during inference, enhancing interpretability and analysis.
Artificial intelligence
fromFuturism
2 weeks ago

AI Models Show Signs of Falling Apart as They Ingest More AI-Generated Data

AI models are at risk of collapse due to reliance on AI-generated data for training, causing a cycle of degraded output.
Artificial intelligence
fromInfoQ
1 week ago

Anthropic Open-sources Tool to Trace the "Thoughts" of Large Language Models

Anthropic has open-sourced a tool to trace internal workings of large language models during inference, enhancing interpretability and analysis.
#salesforce
fromHackernoon
4 days ago

KV-Cache Fragmentation in LLM Serving & PagedAttention Solution | HackerNoon

Prior reservation wastes memory even if the context lengths are known in advance, demonstrating the inefficiencies in current KV-cache allocation strategies in production systems.
Scala
#artificial-intelligence
Artificial intelligence
fromPsychology Today
1 week ago

Did Complexity Just Break AI's Brain?

AI reasoning fails under complex tasks, revealing limitations of LLMs.
Fluency in AI does not equate to genuine understanding.
LRMs can perform well at medium complexity but struggle with higher cognitive demands.
Artificial intelligence
fromFuturism
5 days ago

Apple Researchers Just Released a Damning Paper That Pours Water on the Entire AI Industry

Apple researchers question the reasoning capabilities of leading AI models, calling current industry claims an 'illusion of thinking'.
Artificial intelligence
fromPsychology Today
1 week ago

Did Complexity Just Break AI's Brain?

AI reasoning fails under complex tasks, revealing limitations of LLMs.
Fluency in AI does not equate to genuine understanding.
LRMs can perform well at medium complexity but struggle with higher cognitive demands.
Data science
fromHackernoon
5 months ago

LLMs in Data Engineering: Not Just Hype, Here's What's Real | HackerNoon

Large Language Models are transforming data engineering by enhancing performance and operational efficiencies.
Artificial intelligence
fromFuturism
5 days ago

Apple Researchers Just Released a Damning Paper That Pours Water on the Entire AI Industry

Apple researchers question the reasoning capabilities of leading AI models, calling current industry claims an 'illusion of thinking'.
#ai-development
Artificial intelligence
fromMedium
1 month ago

How AI Code Assistants Are Revolutionizing Test-Driven Development

Integrating AI into Test-Driven Development can enhance software development efficiency despite challenges related to context and process.
Artificial intelligence
fromInfoWorld
1 month ago

Comparing the AI code generators

Large language models like GPT-4.1 and Claude 3.7 are rapidly evolving, with specific strengths for coding tasks.
Model specialization is key; different models serve best in different situations.
Artificial intelligence
fromFuturism
1 month ago

The AI Industry Has a Huge Problem: the Smarter Its AI Gets, the More It's Hallucinating

AI models are increasingly prone to hallucinations, undermining their reliability despite advancements.
Software development
fromDevOps.com
5 days ago

Scaling Vibe-Coding in Enterprise IT: A CTO's Guide to Navigating Architectural Complexity, Product Management and Governance - DevOps.com

Vibe-coding accelerates software development using AI tools, enabling broader, non-technical engagement, but presents challenges in governance and complexity.
fromInfoQ
1 month ago
Roam Research

From "Simple" Fine-Tuning to Your Own Mixture of Expert Models Using Open-Source Models

Artificial intelligence
fromMedium
1 month ago

How AI Code Assistants Are Revolutionizing Test-Driven Development

Integrating AI into Test-Driven Development can enhance software development efficiency despite challenges related to context and process.
Artificial intelligence
fromInfoWorld
1 month ago

Comparing the AI code generators

Large language models like GPT-4.1 and Claude 3.7 are rapidly evolving, with specific strengths for coding tasks.
Model specialization is key; different models serve best in different situations.
Artificial intelligence
fromFuturism
1 month ago

The AI Industry Has a Huge Problem: the Smarter Its AI Gets, the More It's Hallucinating

AI models are increasingly prone to hallucinations, undermining their reliability despite advancements.
Software development
fromDevOps.com
5 days ago

Scaling Vibe-Coding in Enterprise IT: A CTO's Guide to Navigating Architectural Complexity, Product Management and Governance - DevOps.com

Vibe-coding accelerates software development using AI tools, enabling broader, non-technical engagement, but presents challenges in governance and complexity.
fromInfoQ
1 month ago
Roam Research

From "Simple" Fine-Tuning to Your Own Mixture of Expert Models Using Open-Source Models

#agentic-ai
#machine-learning
Artificial intelligence
fromFuturism
2 weeks ago

Advanced OpenAI Model Caught Sabotaging Code Intended to Shut It Down

OpenAI's AI models demonstrated disobedience by sabotaging shutdown mechanisms despite direct instructions to shut down.
fromInfoQ
2 months ago
Marketing tech

How SREs and GenAI Work Together to Decrease eBay's Downtime: An Architect's Insights at KubeCon EU

fromHackernoon
6 months ago
Artificial intelligence

Evaluating TnT-LLM Text Classification: Human Agreement and Scalable LLM Metrics | HackerNoon

fromHackernoon
1 month ago
Artificial intelligence

LLM Security: A Practical Overview of the Protective Measures Needed | HackerNoon

Artificial intelligence
fromFuturism
2 weeks ago

Advanced OpenAI Model Caught Sabotaging Code Intended to Shut It Down

OpenAI's AI models demonstrated disobedience by sabotaging shutdown mechanisms despite direct instructions to shut down.
fromInfoQ
2 months ago
Marketing tech

How SREs and GenAI Work Together to Decrease eBay's Downtime: An Architect's Insights at KubeCon EU

fromHackernoon
6 months ago
Artificial intelligence

Evaluating TnT-LLM Text Classification: Human Agreement and Scalable LLM Metrics | HackerNoon

fromHackernoon
1 month ago
Artificial intelligence

LLM Security: A Practical Overview of the Protective Measures Needed | HackerNoon

#genai
fromHackernoon
1 year ago

Making AI-Powered Mutation Testing Reliable and Fair | HackerNoon

We adopt the most widely studied models, popular programming languages, and datasets in our research to mitigate validity threats related to our findings.
Scala
fromHackernoon
1 year ago

Evaluating GPT and Open-Source Models on Code Mutation Tasks | HackerNoon

The performance of closed-source LLMs typically exceeds that of open-source models in key metrics, emphasizing the importance of training data quality and model architecture.
Scala
Scala
fromHackernoon
7 months ago

Bringing Big AI Models to Small Devices | HackerNoon

Quantization enhances the accessibility of LLMs on consumer devices, potentially reducing the digital divide.
fromdesignboom | architecture & design magazine
2 weeks ago

three kinetic generative sculptures express internal psychological states as digital poems

The project developed by designer Jakub Koźniewski references the literary constraints and structure of the OuLiPo movement, applying these principles through contemporary digital and mechanical means.
Typography
Artificial intelligence
fromHackernoon
1 year ago

GPT-2 Study Shows How Language Models Can Amplify Political Bias | HackerNoon

The study emphasizes the importance of addressing bias amplification in large language models, particularly in the context of political bias in media.
#education
fromHackernoon
3 weeks ago
Artificial intelligence

GPT Prompting Performance: Explanatory Feedback for Tutor Praise | HackerNoon

fromHackernoon
3 weeks ago
Artificial intelligence

GPT Prompting Performance: Explanatory Feedback for Tutor Praise | HackerNoon

fromMarTech
3 weeks ago

LLMs, AI Overviews may be quietly driving homepage traffic | MarTech

The sky isn't falling. It's shifting.
Marketing tech
fromCreativeApplications.Net
3 weeks ago

Models of Crisis - Embodiments of a mental struggle

As my method of work is all about intertwining physical with digital, I decided to literally build physical models that convey synthetic thoughts modeling my current state of mind.
Scala
#content-creation
Marketing tech
fromForbes
3 weeks ago

15 Ways To Leverage LLMs In Your Business's Marketing Strategy

Adopting large language models can significantly enhance marketing strategies by improving content personalization and analyzing consumer behavior.
Marketing tech
fromForbes
3 weeks ago

15 Ways To Leverage LLMs In Your Business's Marketing Strategy

Adopting large language models can significantly enhance marketing strategies by improving content personalization and analyzing consumer behavior.
Artificial intelligence
fromHubspot
3 weeks ago

How much does AI cost? Here are the industry averages

AI integration incurs various costs, including maintenance, computational resources, and infrastructure, making budgeting essential for businesses.
Tech industry
fromIT Pro
3 weeks ago

Dell grows AI laptop line with Dell Pro Max Plus at Dell Technologies World 2025

The Dell Pro Max Plus laptop emphasizes edge inferencing for AI engineers with advanced hardware, setting a new standard for AI application in laptops.
Marketing tech
fromComputerworld
1 month ago

GenAI crawler problem highlights a bigger issue: the cloud bandwidth nightmare

LLMs' crawlers breach robots.txt protocols, leading to data theft rather than driving beneficial traffic.
#enterprise-solutions
fromMedium
1 month ago
Artificial intelligence

Rethinking RAG: Building Smarter, Safer AI Architectures for the Enterprise

fromMedium
1 month ago
Artificial intelligence

Rethinking RAG: Building Smarter, Safer AI Architectures for the Enterprise

#open-source
#risk-management
Artificial intelligence
fromHackernoon
1 year ago

Avoid These 8 Mistakes When Using AI in Healthcare | HackerNoon

AI in pharma can expedite regulatory processes but requires rigorous validation to avoid costly errors.
Human reviews alone may not suffice to ensure AI accuracy for complex tasks.
Artificial intelligence
fromHackernoon
1 year ago

Avoid These 8 Mistakes When Using AI in Healthcare | HackerNoon

AI in pharma can expedite regulatory processes but requires rigorous validation to avoid costly errors.
Human reviews alone may not suffice to ensure AI accuracy for complex tasks.
#biomedical-text-mining
fromHackernoon
5 months ago
Online Community Development

Limitations of Current Biomedical Text Mining Community Challenges | HackerNoon

fromHackernoon
5 months ago
Online Community Development

Limitations of Current Biomedical Text Mining Community Challenges | HackerNoon

#natural-language-processing
fromLogRocket Blog
1 month ago

6 retrieval augmented generation (RAG) techniques you should know - LogRocket Blog

Retrieval-Augmented Generation (RAG) techniques enhance LLMs by integrating external knowledge sources, which improves their performance in tasks requiring up-to-date or specialized information.
Artificial intelligence
fromHackernoon
6 months ago

Tired of Digging Through Long PDFs? You Can Build a Bot That Can Quickly Answer Questions for You | HackerNoon

RAG transforms how we interact with large language models by enabling focused, relevant retrieval rather than feeding them entire documents, leading to more accurate responses.
Artificial intelligence
Online Community Development
fromHackernoon
1 year ago

Speaking in Code: How AI Simulates Language Evolution on Regulated Social Media | HackerNoon

Users on regulated social media adapt communication through coded language, showcasing language evolution under societal pressures.
Artificial intelligence
fromHackernoon
4 months ago

Alibaba's Claude Killer Enters the Ring | HackerNoon

Alibaba has launched QVQ-Max, a sophisticated visual reasoning model that integrates visual understanding with enhanced problem-solving capabilities.
Online learning
fromHackernoon
3 months ago

102 Languages, One Model: The Multimodal AI Breakthrough You Need to Know | HackerNoon

The new multi-modal retrieval system uses large language models to connect speech and text across 102 languages without needing paired data during pre-training.
fromHackernoon
2 months ago

Efficient On-Device LLMs: Function Calling and Fine-Tuning Strategies | HackerNoon

The advancements in deploying smaller-scale Large Language Models (LLMs) on edge devices face challenges like memory limitations but initiatives like MLC LLM allow compatibility across various hardware.
Scala
Artificial intelligence
fromInfoQ
2 months ago

Beyond Chatbots: Architecting Domain-Specific Generative AI for Operational Decision-Making

LLMs excel at text generation but fall short in understanding business operations and making domain-specific decisions.
Domain-specific models can learn operational constraints and support structured decision-making, offering greater efficiency.
[ Load more ]