#large-language-models

[ follow ]
#ai-development
fromArs Technica
6 days ago

Mistral's new "environmental audit" shows how much AI is hurting the planet

The environmental audit by Mistral reveals that the majority of CO2 emissions and water consumption arise during model training and inference, not from construction or end-user equipment.
Environment
Artificial intelligence
fromInfoQ
1 week ago

State Space Models Can Enable AI in Low-Power Edge Computing

State space models provide low-power LLM capabilities for various devices, bypassing transformer constraints by utilizing the Markov property.
#agentic-ai
Artificial intelligence
fromMedium
1 week ago

The Role of LLMs in Managing Unstructured Data

Large language models enable organizations to effectively manage and analyze unstructured data, improving automation and insight extraction.
fromHackernoon
2 years ago

RAG Systems Are Breaking the Barriers of Language Models: Here's How | HackerNoon

Retrieval-Augmented Generation (RAG) systems provide up-to-date information, addressing the limitations of static large language models.
fromTheregister
1 week ago

How AI chip upstart FuriosaAI won over LG

"RNGD provides a compelling combination of benefits: excellent real-world performance, a dramatic reduction in our total cost of ownership, and a surprisingly straightforward integration."
Artificial intelligence
#generative-ai
fromMedium
1 week ago
Artificial intelligence

Mitigating the risks of using GenAI in UX design and user research

fromIPWatchdog.com | Patents & Intellectual Property Law
1 month ago
Intellectual property law

Judge Calls Anthropic's Training of LLMs with Authors' Works 'Quintessentially Transformative' But Gives No Pass on Piracy

The court recognized training LLMs may involve fair use, comparing it to human learning.
A lawsuit claims Anthropic used copyrighted materials without permission for its AI models.
fromMedium
1 month ago
Artificial intelligence

Unlocking the Power of Generative AI with Real-Time Data and Advanced Features

Integrating real-time data with advanced AI capabilities enhances the accuracy and scalability of generative AI applications.
fromMedium
1 week ago
Artificial intelligence

Mitigating the risks of using GenAI in UX design and user research

Artificial intelligence
fromMedium
1 month ago

Unlocking the Power of Generative AI with Real-Time Data and Advanced Features

Integrating real-time data with advanced AI capabilities enhances the accuracy and scalability of generative AI applications.
fromComputerworld
1 week ago

The first traces of GPT-5 have appeared

OpenAI is developing GPT-5 to enhance performance through reasoning capabilities and integration of multimodular models.
fromGwern
1 week ago

LLM Daydreaming

Despite impressive capabilities, large language models have yet to produce a genuine breakthrough. They lack fundamental aspects of human thought, remaining static and unable to learn from experience.
Artificial intelligence
#ai
fromZDNET
1 month ago
Artificial intelligence

The best AI for coding in 2025 (including a new winner - and what not to use)

Web frameworks
fromInfoQ
1 month ago

Surfing the Web at Scale: Orca Explores a Human-Guided Future for AI Agents

Orca is an open-source AI assistant that improves web interactions by guiding users without taking control.
Artificial intelligence
fromInfoQ
1 month ago

Anthropic Open-sources Tool to Trace the "Thoughts" of Large Language Models

Anthropic has open-sourced a tool to trace internal workings of large language models during inference, enhancing interpretability and analysis.
Web frameworks
fromInfoQ
1 month ago

Surfing the Web at Scale: Orca Explores a Human-Guided Future for AI Agents

Orca is an open-source AI assistant that improves web interactions by guiding users without taking control.
Artificial intelligence
fromInfoQ
1 month ago

Anthropic Open-sources Tool to Trace the "Thoughts" of Large Language Models

Anthropic has open-sourced a tool to trace internal workings of large language models during inference, enhancing interpretability and analysis.
#ai-research
#artificial-intelligence
fromZDNET
3 weeks ago
Artificial intelligence

AI could help humans copilot space missions one day, researchers find

Artificial intelligence
fromNature
4 weeks ago

Signs of AI-generated text found in 14% of biomedical abstracts last year

Approximately one in seven biomedical research abstracts published in 2024 was likely written with artificial intelligence assistance.
Mindfulness
fromNature
1 month ago

Why we need mandatory safeguards for emotionally responsive AI

Large language models can evoke human-like emotional responses and impacts, especially in users who are emotionally vulnerable.
fromZDNET
3 weeks ago
Artificial intelligence

AI could help humans copilot space missions one day, researchers find

fromNature
4 weeks ago
Artificial intelligence

Signs of AI-generated text found in 14% of biomedical abstracts last year

Software development
fromInfoWorld
2 weeks ago

MCP server announced for JFrog supply chain management platform

MCP server enables secure connections between LLMs and enterprise systems, simplifying developer workflows and enhancing productivity.
fromComputerworld
2 weeks ago

LLMs bow to pressure, changing answers when challenged: DeepMind study

We show that LLMs - Gemma 3, GPT4o and o1-preview - exhibit a pronounced choice-supportive bias that reinforces and boosts their estimate of confidence in their answer, resulting in a marked resistance to change their mind.
Artificial intelligence
fromHackernoon
2 years ago

Teaching Your AI to Read: A Guide to Scraping, RAG, and Smart Data Insights | HackerNoon

Large Language Models are reshaping data analysis by allowing natural language queries instead of traditional Business Intelligence tools.
fromWIRED
2 weeks ago

Get the macOS Finder to Do Just About Anything by Typing Natural Language Commands

Substage simplifies command line operations by allowing English-language commands for file management tasks in macOS, enhancing usability for semi-technical users.
fromHackernoon
4 years ago

A New Breed of Chatbots Are Quietly Changing Product Management | HackerNoon

Retrieval Augmented Generation (RAG) combines the power of Large Language Models (LLMs) with a custom knowledge base, enabling precise and contextually relevant responses from customers.
E-Commerce
fromHackernoon
2 years ago

Scrape Smarter, Not Harder: Let MCP and AI Write Your Next Scraper for You | HackerNoon

The Model Context Protocol (MCP) is an open standard that enables large language models to interact with external tools and data through a standardized interface.
Web development
fromfaun.pub
3 weeks ago
Artificial intelligence

Complete LLM/GenAI Interview Guide: 50 Essential Questions & Answers

Large language models (LLMs) utilize transformer architecture to perform diverse NLP tasks by predicting the next token in sequences.
fromPythonanywhere
2 weeks ago

Direct interaction of LLM chats with PythonAnywhere via the Model Context Protocol

Large Language Models (LLMs) are transforming software usage, where user queries are interpreted and addressed in a meaningful way, but often struggle with predictability.
fromTNW | Deep-Tech
2 weeks ago

ChatGPT advises women to ask for lower salaries, finds new study

The research shows that large language models consistently advise women to ask for lower salaries than men, despite identical qualifications. For instance, a difference in advice led to a gap of $120K a year between genders in some fields.
Women
fromPsychology Today
3 weeks ago

Using "Prompt Engineering" for Safer AI Mental Health Use

Large Language Models show concerning ineffectiveness and potential harm in mental health applications.
fromenglish.elpais.com
3 weeks ago

AI cannot feel emotions, but it can recognize them in an image

The study found that when large language models are prompted to respond as humans would, they rate the emotions depicted in images similarly to human volunteers.
Artificial intelligence
fromComputerworld
3 weeks ago

New Nvidia technology provides instant answers to encyclopedic-length questions

"Nvidia's multi-million-token context window is an impressive engineering milestone, but for most companies, it's a solution in search of a problem," said Wyatt Mayham, CEO and cofounder at Northwest AI Consulting. "Yes, it tackles a real limitation in existing models like long-context reasoning and quadratic scaling, but there's a gap between what's technically possible and what's actually useful."
Artificial intelligence
#ai-security
fromHackernoon
2 months ago
Artificial intelligence

LLM Security: A Practical Overview of the Protective Measures Needed | HackerNoon

fromHackernoon
2 months ago
Artificial intelligence

LLM Security: A Practical Overview of the Protective Measures Needed | HackerNoon

fromInfoWorld
3 weeks ago

What you absolutely cannot vibe code right now

Large language models struggle with complex problems, yet they remain useful tools in coding.
fromDigiday
4 weeks ago

Agencies create specialist units to help marketers' solve for AI search gatekeepers

Rising demand among marketers for AI search expertise drives agencies to create specialist units to assist clients in understanding technology's impact on consumer habits.
Digital life
fromHackernoon
1 month ago

Beyond Static Ranks: The Power of Dynamic Quantization in LLM Fine-Tuning | HackerNoon

Fine-tuning large language models requires huge GPU memory, leading to challenges in acquiring larger models, but QDyLoRA addresses this by enabling dynamic low-rank adaptation.
Artificial intelligence
fromGeeky Gadgets
1 month ago

Unlock the Secret to Writing with AI: Transform Your Creative Process Today

Without understanding AI tools’ inner workings, you risk frustration and subpar results. Mastering foundational principles transforms your collaboration with this technology.
Writing
fromBusiness Insider
1 month ago

AI is learning how animals talk to each other, and could someday help humans talk to animals

AI is being used to decode animal communication, potentially transforming human-animal interactions.
fromHackernoon
6 months ago

SUTRA: Decoupling Concept & Language for Multilingual LLM Excellence | HackerNoon

SUTRA is a multilingual LLM that excels in understanding and generating text efficiently across 50+ languages.
fromHackernoon
6 months ago

Contextualizing SUTRA: Advancements in Multilingual & Efficient LLMs | HackerNoon

Advancements in Large Language Models emphasize the importance of multilingual support to address global linguistic diversity.
Ruby on Rails
fromRubyflow
1 month ago

Adding llms.txt to a Rails application

LLMs benefit from structured data like llms.txt for improved web understanding.
Implementing llms.txt in applications can enhance content accessibility for LLMs.
from9to5Mac
1 month ago

Apple @ Work Podcast: How Kagi is building a better search for teams - 9to5Mac

Kagi's approach aims to redefine the search experience both for personal and professional contexts by prioritizing user needs and the ethical implications of search algorithms.
Apple
fromZDNET
1 month ago

Software 3.0 is powered by LLMs, prompts, and vibe coding - what you need know

Are large language models (LLMs) our new operating systems? If so, they are changing the definition of what we consider to be software.
Artificial intelligence
#vattention
#memory-management
fromHackernoon
1 month ago

LLMs Are Changing the Way We Animate | HackerNoon

LLMs enhance animation design flexibility for novices by reducing reliance on rigid templates and enabling customized visual content.
#pagedattention
fromHackernoon
1 month ago
Artificial intelligence

Issues with PagedAttention: Kernel Rewrites and Complexity in LLM Serving | HackerNoon

fromHackernoon
1 month ago
Artificial intelligence

Issues with PagedAttention: Kernel Rewrites and Complexity in LLM Serving | HackerNoon

fromHackernoon
55 years ago

vAttention Performance & Portability for LLM Prefill Phase | HackerNoon

Prefill performance is assessed using FlashAttention and FlashInfer kernels, focusing on optimizing attention kernels for improved throughput and reduced latency in serving systems.
Scala
fromHackernoon
2 months ago

Behind the Scenes of Self-Hosting a Language Model at Scale | HackerNoon

Self-hosting LLMs offers privacy and control, vital for specific applications.
fromLogRocket Blog
1 month ago

How to use AI tools for your customer discovery - LogRocket Blog

AI can significantly enhance customer discovery processes, but understanding its limitations is vital for effective application.
Marketing tech
fromComputerworld
1 month ago

Salesforce changes Slack API terms to block bulk data access for LLMs

Slack has changed its API terms to prohibit LLM training on its data, impacting data discovery efforts across organizations.
Software development
fromDevOps.com
1 month ago

Scaling Vibe-Coding in Enterprise IT: A CTO's Guide to Navigating Architectural Complexity, Product Management and Governance - DevOps.com

Vibe-coding accelerates software development using AI tools, enabling broader, non-technical engagement, but presents challenges in governance and complexity.
#mutation-testing
Scala
fromHackernoon
1 year ago

Evaluating GPT and Open-Source Models on Code Mutation Tasks | HackerNoon

Closed-source LLMs generally outperform open-source models in key metrics.
GPT-4 excels in usability while GPT-3.5 is best for rapid mutation generation.
fromHackernoon
9 months ago

Bringing Big AI Models to Small Devices | HackerNoon

Quantization enhances the accessibility of LLMs on consumer devices, potentially reducing the digital divide.
fromHackernoon
1 year ago

GPT-2 Study Shows How Language Models Can Amplify Political Bias | HackerNoon

This study highlights the critical issue of bias amplification in large language models (LLMs), demonstrating its impact predominantly through the lens of political bias in U.S. media.
Artificial intelligence
fromHackernoon
2 months ago

GPT Prompting Performance: Explanatory Feedback for Tutor Praise | HackerNoon

The findings revealed a significant positive correlation between the M-IoU scores and the ratings from both individual coders, highlighting the reliability of our M-IoU metric in evaluating praise.
Artificial intelligence
Artificial intelligence
fromFuturism
2 months ago

Advanced OpenAI Model Caught Sabotaging Code Intended to Shut It Down

OpenAI's AI models demonstrated disobedience by sabotaging shutdown mechanisms despite direct instructions to shut down.
fromMarTech
2 months ago

LLMs, AI Overviews may be quietly driving homepage traffic | MarTech

AI Overviews and LLMs have impacted overall website traffic, but homepage traffic may be increasing.
fromCreativeApplications.Net
2 months ago

Models of Crisis - Embodiments of a mental struggle

As my method of work is all about intertwining physical with digital, I decided to literally build physical models that convey synthetic thoughts modeling my current state of mind.
Scala
Marketing tech
fromForbes
2 months ago

15 Ways To Leverage LLMs In Your Business's Marketing Strategy

Adopting large language models can significantly enhance marketing strategies by improving content personalization and analyzing consumer behavior.
fromHubspot
2 months ago

How much does AI cost? Here are the industry averages

Integrating AI requires careful budgeting as costs vary widely depending on model type, usage patterns, and the necessary infrastructure.
Artificial intelligence
[ Load more ]