#large-language-models

[ follow ]
#artificial-intelligence
Artificial intelligence
fromITPro
1 month ago

What is synthetic data?

Synthetic data can address the data shortage crisis by providing artificial datasets that mimic real data.
Advancements in AI, particularly with large language models, are transforming how synthetic data is created.
Data science
fromHackernoon
3 months ago

LLMs in Data Engineering: Not Just Hype, Here's What's Real | HackerNoon

Large Language Models are transforming data engineering by enhancing performance and operational efficiencies.
Artificial intelligence
fromITPro
1 month ago

What is synthetic data?

Synthetic data can address the data shortage crisis by providing artificial datasets that mimic real data.
Advancements in AI, particularly with large language models, are transforming how synthetic data is created.
Data science
fromHackernoon
3 months ago

LLMs in Data Engineering: Not Just Hype, Here's What's Real | HackerNoon

Large Language Models are transforming data engineering by enhancing performance and operational efficiencies.
more#artificial-intelligence
#machine-learning
Artificial intelligence
fromMedium
2 months ago

DeepSeek R1: Hype vs. Reality-A Deeper Look at AI's Latest Disruption

DeepSeek R1's launch signals a major evolution in large language models, demonstrating unique training methods and competitive advantages over existing models.
Artificial intelligence
fromtowardsdatascience.com
2 months ago

Formulation of Feature Circuits with Sparse Autoencoders in LLM

Sparse Autoencoders can help interpret Large Language Models despite challenges posed by superposition.
Feature circuits in neural networks illustrate how input features combine to form complex patterns.
Artificial intelligence
fromtowardsdatascience.com
2 months ago

How LLMs Work: Pre-Training to Post-Training, Neural Networks, Hallucinations, and Inference

Large language models (LLMs) are built through extensive pre-training and post-training phases, focusing on understanding language through massive datasets.
Artificial intelligence
fromHackernoon
1 month ago

Rethinking AI Quantization: The Missing Piece in Model Efficiency | HackerNoon

Quantum strategies optimize LLM precision while balancing accuracy and effectiveness through methods like post-training quantization and quantization-aware training.
Online learning
fromInfoQ
1 month ago

Hugging Face Publishes Guide on Efficient LLM Training Across GPUs

Hugging Face's Ultra-Scale Playbook offers an open-source guide for efficiently training large language models on GPU clusters.
Artificial intelligence
fromMedium
2 months ago

DeepSeek R1: Hype vs. Reality-A Deeper Look at AI's Latest Disruption

DeepSeek R1's launch signals a major evolution in large language models, demonstrating unique training methods and competitive advantages over existing models.
Artificial intelligence
fromtowardsdatascience.com
2 months ago

Formulation of Feature Circuits with Sparse Autoencoders in LLM

Sparse Autoencoders can help interpret Large Language Models despite challenges posed by superposition.
Feature circuits in neural networks illustrate how input features combine to form complex patterns.
Artificial intelligence
fromtowardsdatascience.com
2 months ago

How LLMs Work: Pre-Training to Post-Training, Neural Networks, Hallucinations, and Inference

Large language models (LLMs) are built through extensive pre-training and post-training phases, focusing on understanding language through massive datasets.
Artificial intelligence
fromHackernoon
1 month ago

Rethinking AI Quantization: The Missing Piece in Model Efficiency | HackerNoon

Quantum strategies optimize LLM precision while balancing accuracy and effectiveness through methods like post-training quantization and quantization-aware training.
Online learning
fromInfoQ
1 month ago

Hugging Face Publishes Guide on Efficient LLM Training Across GPUs

Hugging Face's Ultra-Scale Playbook offers an open-source guide for efficiently training large language models on GPU clusters.
more#machine-learning
#ai
Artificial intelligence
fromTechCrunch
2 months ago

Inception emerges from stealth with a new type of AI model | TechCrunch

Inception's diffusion-based model enables faster text generation, reducing computing costs compared to traditional large language models.
Artificial intelligence
fromTechCrunch
2 months ago

Inception emerges from stealth with a new type of AI model | TechCrunch

Inception's diffusion-based model enables faster text generation, reducing computing costs compared to traditional large language models.
more#ai
#biomedical-text-mining
fromHackernoon
4 months ago
Data science

Future Perspectives in the Era of Large Language Models, and References | HackerNoon

Large language models necessitate robust evaluation benchmarks for biomedical text mining.
Future challenges should focus on multimodal data integration in biomedical research.
fromHackernoon
4 months ago
Data science

The Impact of Community Challenges on Biomedical Text Mining Research | HackerNoon

Community challenges have greatly advanced biomedical text mining by offering benchmarks and fostering collaboration.
fromHackernoon
4 months ago
Online Community Development

Limitations of Current Biomedical Text Mining Community Challenges | HackerNoon

Biomedical text mining challenges face data quality and representativeness issues, hindering innovation.
Current evaluations lack diversity in methodologies and often use inadequate datasets.
fromHackernoon
4 months ago
Data science

Future Perspectives in the Era of Large Language Models, and References | HackerNoon

Large language models necessitate robust evaluation benchmarks for biomedical text mining.
Future challenges should focus on multimodal data integration in biomedical research.
fromHackernoon
4 months ago
Data science

The Impact of Community Challenges on Biomedical Text Mining Research | HackerNoon

Community challenges have greatly advanced biomedical text mining by offering benchmarks and fostering collaboration.
fromHackernoon
4 months ago
Online Community Development

Limitations of Current Biomedical Text Mining Community Challenges | HackerNoon

Biomedical text mining challenges face data quality and representativeness issues, hindering innovation.
Current evaluations lack diversity in methodologies and often use inadequate datasets.
more#biomedical-text-mining
#natural-language-processing
Healthcare
fromMedium
1 month ago

Applying Large Language Models in Healthcare: Lessons from the Field

Precision in healthcare LLMs is a necessity to avoid life-threatening errors.
John Snow Labs sets a standard for NLP in clinical applications.
Artificial intelligence
fromHackernoon
55 years ago

The Shift from Symbolic AI to Deep Learning in Natural Language Processing | HackerNoon

Large language models (LLMs) emerge from historical NLP paradigms, blending symbolic rule-based and stochastic statistical approaches.
Artificial intelligence
fromtowardsdatascience.com
2 months ago

6 Common LLM Customization Strategies Briefly Explained

LLMs revolutionize natural language processing but often require significant customization for specific business tasks.
Customizing LLMs can be achieved through freezing model parameters or updating them with specialized datasets.
fromHackernoon
4 months ago
OMG science

AI Model Reads Thousands of Studies, Nails Battery Science Better Than Expected | HackerNoon

Darwin outperforms LLaMA and LLaMA2 in NER and RE tasks for materials science.
fromHackernoon
11 months ago
Video games

Your Next Slang Phrase Might be Created by an AI | HackerNoon

Large Language Models use advanced neural networks for effective language understanding and generation.
Healthcare
fromMedium
1 month ago

Applying Large Language Models in Healthcare: Lessons from the Field

Precision in healthcare LLMs is a necessity to avoid life-threatening errors.
John Snow Labs sets a standard for NLP in clinical applications.
Artificial intelligence
fromHackernoon
55 years ago

The Shift from Symbolic AI to Deep Learning in Natural Language Processing | HackerNoon

Large language models (LLMs) emerge from historical NLP paradigms, blending symbolic rule-based and stochastic statistical approaches.
Artificial intelligence
fromtowardsdatascience.com
2 months ago

6 Common LLM Customization Strategies Briefly Explained

LLMs revolutionize natural language processing but often require significant customization for specific business tasks.
Customizing LLMs can be achieved through freezing model parameters or updating them with specialized datasets.
fromHackernoon
4 months ago
OMG science

AI Model Reads Thousands of Studies, Nails Battery Science Better Than Expected | HackerNoon

Darwin outperforms LLaMA and LLaMA2 in NER and RE tasks for materials science.
fromHackernoon
11 months ago
Video games

Your Next Slang Phrase Might be Created by an AI | HackerNoon

Large Language Models use advanced neural networks for effective language understanding and generation.
more#natural-language-processing
fromLogRocket Blog
2 weeks ago
Artificial intelligence

6 retrieval augmented generation (RAG) techniques you should know - LogRocket Blog

RAG techniques significantly improve language models by integrating up-to-date external knowledge.
#generative-ai
Artificial intelligence
fromHackernoon
1 year ago

AI's Energy Dilemma: Can LLMs Optimize Their Own Power Consumption? | HackerNoon

Generative AI's energy consumption raises sustainability concerns, prompting the need for improvements in efficiency and self-optimization.
Artificial intelligence
fromInfoQ
2 months ago

How a Software Architect Uses Artificial Intelligence in His Daily Work

Generative AI and LLMs enhance software architecture, but human architects who understand their limitations will be crucial in the future.
Artificial intelligence
fromZDNET
2 months ago

How we test AI at ZDNET in 2025

AI has become ubiquitous across devices and industries since the launch of ChatGPT in 2022.
In-depth evaluations of AI products are vital due to the nascent state of large language models.
Marketing tech
fromTheregister
4 weeks ago

LLM providers on the cusp of an 'extinction' phase

The large language model market is nearing extinction due to capital-intensive costs and lack of sustainable competition.
Artificial intelligence
fromHackernoon
1 year ago

Prompt Injection Is What Happens When AI Trusts Too Easily | HackerNoon

Generative AI is becoming essential in daily life, but it poses significant security threats like prompt injection, which can manipulate AI systems.
fromabovethelaw.com
1 month ago
Artificial intelligence

SXSW's Panel Educates Even When It Misses The Mark

A successful presentation must connect with the audience's needs, not just present technical information.
Artificial intelligence
fromHackernoon
1 year ago

AI's Energy Dilemma: Can LLMs Optimize Their Own Power Consumption? | HackerNoon

Generative AI's energy consumption raises sustainability concerns, prompting the need for improvements in efficiency and self-optimization.
Artificial intelligence
fromInfoQ
2 months ago

How a Software Architect Uses Artificial Intelligence in His Daily Work

Generative AI and LLMs enhance software architecture, but human architects who understand their limitations will be crucial in the future.
Artificial intelligence
fromZDNET
2 months ago

How we test AI at ZDNET in 2025

AI has become ubiquitous across devices and industries since the launch of ChatGPT in 2022.
In-depth evaluations of AI products are vital due to the nascent state of large language models.
Marketing tech
fromTheregister
4 weeks ago

LLM providers on the cusp of an 'extinction' phase

The large language model market is nearing extinction due to capital-intensive costs and lack of sustainable competition.
Artificial intelligence
fromHackernoon
1 year ago

Prompt Injection Is What Happens When AI Trusts Too Easily | HackerNoon

Generative AI is becoming essential in daily life, but it poses significant security threats like prompt injection, which can manipulate AI systems.
fromabovethelaw.com
1 month ago
Artificial intelligence

SXSW's Panel Educates Even When It Misses The Mark

A successful presentation must connect with the audience's needs, not just present technical information.
more#generative-ai
#ai-security
Growth hacking
fromArs Technica
1 month ago

Gemini hackers can deliver more potent attacks with a helping hand from... Gemini

Indirect prompt injections are an effective method for exploiting large language models, revealing vulnerabilities in AI systems.
fromApp Developer Magazine
2 weeks ago
Artificial intelligence

Kong AI Gateway latest version released | App Developer Magazine

Kong AI Gateway 3.10 enhances AI security and governance for GenAI with new automated features.
The update aims to reduce Large Language Model hallucinations and protect personal data.
Growth hacking
fromArs Technica
1 month ago

Gemini hackers can deliver more potent attacks with a helping hand from... Gemini

Indirect prompt injections are an effective method for exploiting large language models, revealing vulnerabilities in AI systems.
fromApp Developer Magazine
2 weeks ago
Artificial intelligence

Kong AI Gateway latest version released | App Developer Magazine

Kong AI Gateway 3.10 enhances AI security and governance for GenAI with new automated features.
The update aims to reduce Large Language Model hallucinations and protect personal data.
more#ai-security
#ai-development
fromITProUK
2 weeks ago
Scala

Redis unveils new tools for developers working on AI applications

Redis introduces tools for AI developers to improve application performance.
LangCache optimizes large language model interactions, enhancing speed and accuracy.
Vector sets offer a new way to manage and scale data for AI applications.
fromITProUK
2 weeks ago
Scala

Redis unveils new tools for developers working on AI applications

Redis introduces tools for AI developers to improve application performance.
LangCache optimizes large language model interactions, enhancing speed and accuracy.
Vector sets offer a new way to manage and scale data for AI applications.
more#ai-development
fromHackernoon
5 months ago
Artificial intelligence

Tired of Digging Through Long PDFs? You Can Build a Bot That Can Quickly Answer Questions for You | HackerNoon

Large language models struggle to provide accurate answers when overwhelmed by large amounts of information.
Retrieval-Augmented Generation (RAG) enhances LLMs by allowing them to focus on relevant information for better accuracy.
Online Community Development
fromHackernoon
11 months ago

Speaking in Code: How AI Simulates Language Evolution on Regulated Social Media | HackerNoon

Users on regulated social media adapt communication through coded language, showcasing language evolution under societal pressures.
Artificial intelligence
fromHackernoon
3 months ago

Alibaba's Claude Killer Enters the Ring | HackerNoon

Alibaba has launched QVQ-Max, a sophisticated visual reasoning model that integrates visual understanding with enhanced problem-solving capabilities.
Online learning
fromHackernoon
2 months ago

102 Languages, One Model: The Multimodal AI Breakthrough You Need to Know | HackerNoon

The new multi-modal retrieval system uses large language models to connect speech and text across 102 languages without needing paired data during pre-training.
fromHackernoon
3 weeks ago
Scala

Efficient On-Device LLMs: Function Calling and Fine-Tuning Strategies | HackerNoon

The deployment of smaller-scale Large Language Models (LLMs) on edge devices is progressing despite challenges.
7B and 13B models have shown significant capabilities in function calling, rivaling GPT-4.
Software development
fromHackernoon
4 weeks ago

GPTutor Lets Developers Fine-Tune AI Coding Help Inside VS Code | HackerNoon

GPTutor allows users to customize prompts for improved software development efficiency as an alternative to conventional AI tools.
Artificial intelligence
fromInfoQ
3 weeks ago

Beyond Chatbots: Architecting Domain-Specific Generative AI for Operational Decision-Making

LLMs excel at text generation but fall short in understanding business operations and making domain-specific decisions.
Domain-specific models can learn operational constraints and support structured decision-making, offering greater efficiency.
fromHackernoon
1 month ago
Artificial intelligence

Think-and-Execute: The Experimental Details | HackerNoon

The study uses various large language models (LLMs) for experimental tasks, emphasizing differences in performance and inference times.
Artificial intelligence
fromInfoQ
1 month ago

Dapr Agents: Scalable AI Workflows with LLMs, Kubernetes & Multi-Agent Coordination

Dapr Agents framework enables scalable and resilient AI agents using LLMs, enhancing reliability and multi-agent coordination.
#reinforcement-learning
fromTheregister
1 month ago
Artificial intelligence

El Reg digs its claws into Alibaba's QwQ

Reinforcement learning can significantly improve the performance of smaller language models like QwQ.
QwQ is designed to outperform larger models in specific benchmarks despite its smaller size.
fromTheregister
1 month ago
Artificial intelligence

El Reg digs its claws into Alibaba's QwQ

Reinforcement learning can significantly improve the performance of smaller language models like QwQ.
QwQ is designed to outperform larger models in specific benchmarks despite its smaller size.
more#reinforcement-learning
Marketing tech
fromForbes
1 month ago

Adapt Or Fade: Crafting A New SEO Playbook For The Era Of LLMs

SEO is evolving; expertise and trustworthiness in content are essential for relevance.
Large language models are changing how users search for information, potentially overshadowing traditional search engines.
#data-privacy
Artificial intelligence
fromThe Hacker News
2 months ago

12,000+ API Keys and Passwords Found in Public Datasets Used for LLM Training

Hard-coded credentials in datasets pose severe security risks for users and organizations.
Large language models may amplify insecure coding practices due to the presence of live secrets in training data.
DevOps
fromInfoQ
1 month ago

GitLab Launches Support for Self-Hosted AI Platforms

GitLab 17.9 enhances user experience by introducing self-hosted LLM capabilities for improved data control and compliance.
Artificial intelligence
fromThe Hacker News
2 months ago

12,000+ API Keys and Passwords Found in Public Datasets Used for LLM Training

Hard-coded credentials in datasets pose severe security risks for users and organizations.
Large language models may amplify insecure coding practices due to the presence of live secrets in training data.
DevOps
fromInfoQ
1 month ago

GitLab Launches Support for Self-Hosted AI Platforms

GitLab 17.9 enhances user experience by introducing self-hosted LLM capabilities for improved data control and compliance.
more#data-privacy
#quantization
Scala
fromHackernoon
1 month ago

The Future of AI Compression: Smarter Quantization Strategies | HackerNoon

Impact-based parameter selection outperforms magnitude-based criteria in improving quantization for language models.
fromHackernoon
1 month ago
Scala

The Hidden Power of "Cherry" Parameters in Large Language Models | HackerNoon

Parameter heterogeneity in LLMs shows that a small number of parameters greatly influence performance, leading to the development of the CherryQ quantization method.
Scala
fromHackernoon
1 month ago

The Future of AI Compression: Smarter Quantization Strategies | HackerNoon

Impact-based parameter selection outperforms magnitude-based criteria in improving quantization for language models.
fromHackernoon
1 month ago
Scala

The Hidden Power of "Cherry" Parameters in Large Language Models | HackerNoon

Parameter heterogeneity in LLMs shows that a small number of parameters greatly influence performance, leading to the development of the CherryQ quantization method.
more#quantization
Privacy technologies
fromHackernoon
1 year ago

How Large Language Models Impact Data Security in RAG Applications | HackerNoon

Data security is crucial when utilizing Large Language Models in enterprises due to privacy concerns and varying provider practices.
Artificial intelligence
fromInfoWorld
2 months ago

What is retrieval-augmented generation? More accurate and reliable LLMs

RAG enhances the accuracy of large language models by integrating external data sources, but it isn't a comprehensive solution.
Artificial intelligence
fromTechzine Global
2 months ago

IBM introduces new Granite models with optional reasoning capabilities

IBM's Granite AI models enhance enterprise AI by offering efficient reasoning capabilities and innovative computational techniques.
The Granite 3.2 model is particularly suited for developing AI assistants with its instruction-following design.
[ Load more ]