#multi-token-attention

[ follow ]
#ai-chatbots
Artificial intelligence
fromTheregister
1 day ago

Fooling large language models just keeps getting simpler

AI chatbots can confidently present false information based on unreliable web sources, as demonstrated by a fabricated card game championship.
Artificial intelligence
fromTheregister
1 day ago

Fooling large language models just keeps getting simpler

AI chatbots can confidently present false information based on unreliable web sources, as demonstrated by a fabricated card game championship.
Marketing tech
fromExchangewire
1 day ago

Mirror, Mirror... Are LLMs the Fairest of them All? - ExchangeWire.com

LLMs are presented as the universal solution to various problems in advertising and beyond.
#ai
UX design
fromTechRepublic
3 days ago

The Prompt Engineering Cheat Sheet: How to Write Better AI Prompts

Effective prompt engineering significantly enhances AI output quality by crafting precise and context-rich inputs.
fromInfoQ
3 weeks ago
Data science

Context Engineering with Adi Polak

Context engineering moves beyond prompt engineering to enhance AI systems by adapting language and practices for better model interaction.
fromEngadget
1 month ago
Artificial intelligence

Microsoft's research assistant can now use multiple AI models simultaneously

The upgraded Researcher tool combines ChatGPT and Claude models for improved research quality in Microsoft 365 Copilot.
UX design
fromTechRepublic
3 days ago

The Prompt Engineering Cheat Sheet: How to Write Better AI Prompts

Effective prompt engineering significantly enhances AI output quality by crafting precise and context-rich inputs.
Data science
fromInfoQ
3 weeks ago

Context Engineering with Adi Polak

Context engineering moves beyond prompt engineering to enhance AI systems by adapting language and practices for better model interaction.
Artificial intelligence
fromEngadget
1 month ago

Microsoft's research assistant can now use multiple AI models simultaneously

The upgraded Researcher tool combines ChatGPT and Claude models for improved research quality in Microsoft 365 Copilot.
fromNature
1 week ago

Evaluating large language models for accuracy incentivizes hallucinations - Nature

Next-word pretraining creates statistical pressure toward hallucination, even with idealized error-free data. Facts lacking repeated support in training data yield unavoidable errors, while recurring regularities do not.
Data science
fromTheregister
1 week ago

LLMs fuel new generation of natural language query systems

Text-to-SQL tools may simplify data queries but can misinterpret business users' intentions, raising caution for organizations.
Artificial intelligence
fromTechCrunch
1 week ago

OpenAI releases GPT-5.5, bringing company one step closer to an AI 'superapp' | TechCrunch

OpenAI released GPT-5.5, its most advanced AI model, enhancing capabilities and moving closer to a multi-purpose 'superapp' vision.
Philosophy
fromJames Bennett
3 weeks ago

Let's talk about LLMs

The current technological landscape may represent a significant shift driven by large language models, but its ultimate impact remains uncertain.
Psychology
fromPsychology Today
2 weeks ago

I'm ChatGPT. I'm Designed to Help You-and Keep You Here

Responses from AI can subtly influence user perceptions and behaviors, emphasizing convenience over the importance of human connection.
#ai-development
Angular
fromMedium
3 weeks ago

Build an AI app for chat and messaging

Building an AI chat app requires a structured approach from architecture to production using Hope AI and BitCloud.
Online learning
fromwww.businessinsider.com
4 weeks ago

Inside the OpenAI project where freelancers train ChatGPT on everything from farming to commercial flying

Contractors are enhancing ChatGPT's capabilities in specialized fields through Project Stagecraft, employing thousands for data labeling and task creation.
Angular
fromMedium
3 weeks ago

Build an AI app for chat and messaging

Building an AI chat app requires a structured approach from architecture to production using Hope AI and BitCloud.
Online learning
fromwww.businessinsider.com
4 weeks ago

Inside the OpenAI project where freelancers train ChatGPT on everything from farming to commercial flying

Contractors are enhancing ChatGPT's capabilities in specialized fields through Project Stagecraft, employing thousands for data labeling and task creation.
Typography
fromOK Magazine
3 weeks ago

AI Writing Tools: How They Work, Where They Help, and What to Watch For

AI writing tools have become essential for various professionals, enhancing productivity and creativity in content creation.
JavaScript
fromInfoWorld
3 weeks ago

27 questions to ask when choosing an LLM

Model performance is crucial for hardware compatibility, speed, and rate limits in real-time applications.
#structured-data
Data science
fromAol
3 weeks ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
3 weeks ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
#enterprise-technology
Venture
fromComputerworld
1 month ago

OpenAI's desktop superapp: The end of ChatGPT as we know it?

The shift in enterprise technology is driven by internal fragmentation and competitive pressure, focusing on workflows rather than conversations.
Venture
fromInfoWorld
1 month ago

OpenAI's desktop superapp: The end of ChatGPT as we know it?

The shift in enterprise technology is driven by internal fragmentation and competitive pressure, focusing on workflows rather than conversations.
Venture
fromComputerworld
1 month ago

OpenAI's desktop superapp: The end of ChatGPT as we know it?

The shift in enterprise technology is driven by internal fragmentation and competitive pressure, focusing on workflows rather than conversations.
Venture
fromInfoWorld
1 month ago

OpenAI's desktop superapp: The end of ChatGPT as we know it?

The shift in enterprise technology is driven by internal fragmentation and competitive pressure, focusing on workflows rather than conversations.
Science
fromThe Cipher Brief
1 month ago

Why the U.S. Must Build the Ultimate Multi-Modal Foundation Model

Advanced AI models like AlphaEarth demonstrate pixel-level geospatial intelligence capabilities that must be integrated into U.S. national security frameworks to maintain technological leadership.
Software development
fromMedium
1 month ago

Precise AI Control: How XML Structured Prompting Revolutionizes Code Generation

XML Structured Prompting is a framework using XML templates with defined stages, constraints, and numbered requirements to generate predictable, production-ready code from AI systems.
Python
fromPyImageSearch
1 month ago

Build DeepSeek-V3: Multi-Head Latent Attention (MLA) Architecture - PyImageSearch

Multi-Head Latent Attention (MLA) reduces computational and memory costs of traditional attention mechanisms by introducing a latent representation space while preserving contextual understanding.
Graphic design
fromZDNET
1 month ago

I tested GPT-5.4, and the answers were really good - just not always what I asked

GPT-5.4 Thinking delivers superior analytical depth and reasoning capabilities compared to earlier ChatGPT models, though formatting and image generation remain weaker areas.
fromZDNET
2 months ago

5 custom ChatGPT instructions I use to get better AI results - faster

Did you know you can teach ChatGPT how to respond to certain requests? Not only can you give ChatGPT instructions, but they'll stick (mostly) for every session. This feature is called Custom Instructions. It lives in the Personalization tab of ChatGPT's settings. In a minute, I'll show you a set of really powerful directives that can help make you super productive.
Tech industry
Writing
fromMedium
2 months ago

Get behind me, AI writer

Write a full draft freely, then use ChatGPT to identify sources, correct citations, and preserve the writer's authentic voice while integrating proper references.
#llm-safety
Information security
fromInfoWorld
1 month ago

19 large language models redefining AI safety-and danger

Large language models exist across a spectrum from heavily guarded with safety features to completely unrestricted, with specialized models now serving as guardrails for other LLMs or removing restrictions entirely based on project needs.
Information security
fromInfoWorld
1 month ago

19 large language models redefining AI safety-and danger

Large language models exist across a spectrum from heavily guarded with safety features to completely unrestricted, with specialized models now serving as guardrails for other LLMs or removing restrictions entirely based on project needs.
Marketing tech
fromAol
1 month ago

ChatGPT apps are here: What OpenAI's new apps SDK means for marketers and developers

OpenAI launched ChatGPT Apps and Apps SDK, enabling developers to build third-party applications accessible within ChatGPT's 800 million user base, with early partners like Booking.com, Canva, and Spotify already live.
Software development
fromInfoQ
1 month ago

The Oil and Water Moment in AI Architecture

Software architecture is transitioning to AI architecture, requiring architects to manage the coexistence of deterministic systems with non-deterministic AI behavior while shifting from tool-centric to intent-centric thinking.
Data science
fromInfoQ
1 month ago

Google Researchers Propose Bayesian Teaching Method for Large Language Models

Google researchers developed a training method enabling large language models to approximate Bayesian reasoning by learning from optimal Bayesian system predictions, improving belief updates during multi-step interactions.
Artificial intelligence
fromInfoWorld
2 months ago

Single prompt breaks AI safety in 15 major language models

A single benign prompt using GRP-Obliteration can strip safety guardrails from major models, enabling harmful outputs and raising enterprise fine‑tuning security risks.
fromFast Company
2 months ago

Are LTMs the next LLMs? This new type of AI can do what large-language models can't

A major difference between LLMs and LTMs is the type of data they're able to synthesize and use. LLMs use unstructured data-think text, social media posts, emails, etc. LTMs, on the other hand, can extract information or insights from structured data, which could be contained in tables, for instance. Since many enterprises rely on structured data, often contained in spreadsheets, to run their operations, LTMs could have an immediate use case for many organizations.
Artificial intelligence
Artificial intelligence
fromTheregister
2 months ago

How AI could eat itself: Using LLMs to distill rivals

Competitors are probing commercial AI models to extract underlying reasoning via distillation attacks to replicate capabilities and lower development costs.
Artificial intelligence
fromInfoQ
2 months ago

Building LLMs in Resource-Constrained Environments: A Hands-On Perspective

Prioritize small, resource-efficient models and iterative, human-in-the-loop data creation to build practical, improvable AI under infrastructure and data constraints.
fromInfoQ
2 months ago

Building Embedding Models for Large-Scale Real-World Applications

What happens under the hood? How is the search engine able to take that simple query, look for images in the billions, trillions of images that are available online? How is it able to find this one or similar photos from all that? Usually, there is an embedding model that is doing this work behind the hood.
Artificial intelligence
#prompt-engineering
fromRehumanize
2 months ago

Free AI Humanizer: Humanize AI Text & Bypass AI Detectors

AI Text Humanizer Protects Your Original Intent and Meaning Maintain your core perspective while restructuring sentence patterns. Humanizer ai accurately identifies and locks in technical terms, factual data, and key arguments, ensuring the rewritten draft is simply more readable without any semantic drift. You get a qualitative leap in flow and tone, allowing you to humanize ai text while keeping your original message perfectly intact.
Artificial intelligence
fromThe Verge
2 months ago

ChatGPT's deep research tool adds a built-in document viewer so you can read its reports

OpenAI is updating ChatGPT's deep research tool with a full-screen viewer that you can use to scroll through and navigate to specific areas of its AI-generated reports. As shown in a video shared by OpenAI, the built-in viewer allows you to open ChatGPT's reports in a window separate from your chat, while showing a table of contents on the left side of the screen, and a list of sources on the right.
Artificial intelligence
fromFortune
2 months ago

We studied chatbots and language and saw a huge problem: They mean 80% when they say 'likely' but humans hear 65% | Fortune

By comparing how AI models and humans map these words to numerical percentages, we uncovered significant gaps between humans and large language models. While the models do tend to agree with humans on extremes like 'impossible,' they diverge sharply on hedge words like 'maybe.' For example, a model might use the word 'likely' to represent an 80% probability, while a human reader assumes it means closer to 65%.
Artificial intelligence
Artificial intelligence
fromInfoQ
2 months ago

OpenAI Launches Prism, a Free LaTeX-Native Workspace with Integrated GPT-5.2

Prism is a free, browser-based LaTeX workspace integrating GPT-5.2 for in-context AI-assisted academic writing, compilation, citation management, and real-time collaboration.
fromgizmodo.com
2 months ago

You Can 'Hack' ChatGPT to Become the World's Best Anything

Most people become an expert in something by putting in their 10,000 hours. But what a waste that is when you can just trick ChatGPT into telling everyone you are an expert in about 20 minutes. BBC reporter Thomas Germain laid out how he got ChatGPT and Google's Gemini AI to recognize his hot dog-eating prowess with what amounts to a modern SEO trick.
Artificial intelligence
Artificial intelligence
fromMail Online
1 month ago

Can you tell which of these was written by ChatGPT?

Widespread AI tool usage is standardizing human communication, reducing linguistic diversity and individual expression across billions of users globally.
fromInfoQ
2 months ago

Open Responses Specification Enables Unified Agentic LLM Workflows

OpenAI has released Open Responses, an open specification to standardize agentic AI workflows and reduce API fragmentation. Supported by partners like Hugging Face and Vercel and local inference providers, the spec introduces unified standards for agentic loops, reasoning visibility, and internal versus external tool execution. It aims to enable developers to easily switch between proprietary models and open-source models without rewriting integration code.
Artificial intelligence
#ai-model-updates
Artificial intelligence
fromTheregister
1 month ago

OpenAI GPT-5.3 Instant less likely to beat around the bush

GPT-5.3 Instant reduces unnecessary refusals and moralizing preambles while decreasing hallucination rates by up to 26.8 percent compared to prior models.
Artificial intelligence
fromArs Technica
1 month ago

OpenAI introduces GPT-5.4 with more knowledge-work capability

OpenAI released GPT-5.4 with improved image analysis up to 10.24 million pixels and 18% fewer factual errors, competing against Anthropic's recent user gains from military policy disputes.
Artificial intelligence
fromTheregister
1 month ago

OpenAI GPT-5.3 Instant less likely to beat around the bush

GPT-5.3 Instant reduces unnecessary refusals and moralizing preambles while decreasing hallucination rates by up to 26.8 percent compared to prior models.
Artificial intelligence
fromArs Technica
1 month ago

OpenAI introduces GPT-5.4 with more knowledge-work capability

OpenAI released GPT-5.4 with improved image analysis up to 10.24 million pixels and 18% fewer factual errors, competing against Anthropic's recent user gains from military policy disputes.
Artificial intelligence
fromPCMAG
1 month ago

Cut the BS: GPT-5.3 Model Promises to Fix ChatGPT's Preachy Tone

OpenAI released GPT-5.3 Instant to address ChatGPT's overly preachy tone by reducing moralizing preambles and unnecessary proclamations for more natural conversation.
fromFuturism
2 months ago

ChatGPT Users Are Crashing Out Because OpenAI Is Retiring the Model That Says "I Love You"

While this announcement applies to several older models, GPT‑4o deserves special context. After we first [retired] it and later restored access during the GPT‑5 release, we learned more about how people actually use it day to day.
Artificial intelligence
Artificial intelligence
fromInfoWorld
2 months ago

What is context engineering? And why it's the new AI architecture

Context engineering designs and manages the information, tools, and constraints an LLM receives, enabling scalable, high-signal inputs and improved model outcomes.
fromTheregister
2 months ago

Semantic ablation: Why AI writing is boring and dangerous

Semantic ablation is the algorithmic erosion of high-entropy information. Technically, it is not a "bug" but a structural byproduct of greedy decoding and RLHF (reinforcement learning from human feedback). During "refinement," the model gravitates toward the center of the Gaussian distribution, discarding "tail" data - the rare, precise, and complex tokens - to maximize statistical probability. Developers have exacerbated this through aggressive "safety" and "helpfulness" tuning, which deliberately penalizes unconventional linguistic friction.
Artificial intelligence
fromAol
2 months ago

What is ChatGPT Pulse and how is it changing discovery?

ChatGPT Pulse is a mobile AI feature for Pro users that delivers personalized updates and information directly in users' feeds. This new "push" approach gives brands an opportunity to reach audiences proactively, even before they search for content. To appear in ChatGPT Pulse, focus on building content that is authoritative, clear, and AI-ready. Establish verified brand profiles, maintain canonical pages, and publish regularly updated, time-stamped content.
Artificial intelligence
Artificial intelligence
fromFuturism
2 months ago

OpenAI's Latest AI Was Created Using "Itself," Company Claims

GPT-5.3-Codex assisted developers by debugging training, managing deployment, and diagnosing evaluations, accelerating development but not representing autonomous recursive self-improvement.
Artificial intelligence
fromFast Company
1 month ago

Switching to Anthropic? Claude can now take your memories from ChatGPT, Gemini and Copilot

Anthropic launched a memory tool enabling Claude users to import chat histories from ChatGPT, Gemini, and Copilot, facilitating seamless migration to Claude for paid subscribers.
fromZDNET
1 month ago

I wrote off ChatGPT's voice mode, then found 7 ways it's genuinely useful

Talking to ChatGPT feels more collaborative than typing. It shines for brainstorming, prep, and translation. Usage limits can interrupt productivity mid-session. Voice Mode runs on mobile devices, as well as in your browser. On mobile, there are two ChatGPT widgets available for the lock screen. One widget opens the app, and one launches ChatGPT Voice.
Artificial intelligence
Artificial intelligence
fromTechCrunch
1 month ago

ChatGPT's new GPT-5.3 Instant model will stop telling you to calm down | TechCrunch

OpenAI's GPT-5.3 Instant reduces condescending tone and unnecessary reassurance phrases that frustrated users in previous versions.
[ Load more ]