Address
:
[go:
up one dir
,
main page
]
Include Form
Remove Scripts
Accept Cookies
Show Images
Show Referer
Rotate13
Base64
Strip Meta
Strip Title
Session Cookies
Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Metí Gemma corriendo en el browser, sin API keys, y me cambió cómo pienso el edge
Juan Torchia
Juan Torchia
Juan Torchia
Follow
Apr 8
Metí Gemma corriendo en el browser, sin API keys, y me cambió cómo pienso el edge
#
react
#
nextjs
#
llm
#
webgpu
Comments
Add Comment
9 min read
I Built a RAG Pipeline. Then I Realized Retrieval Is the Real Model
jacobjerryarackal
jacobjerryarackal
jacobjerryarackal
Follow
Apr 8
I Built a RAG Pipeline. Then I Realized Retrieval Is the Real Model
#
ai
#
llm
#
machinelearning
#
rag
1
reaction
Comments
Add Comment
3 min read
Why We Ditched Bedrock Agents for Nova Pro and Built a Custom Orchestrator
Alex Vega
Alex Vega
Alex Vega
Follow
Apr 5
Why We Ditched Bedrock Agents for Nova Pro and Built a Custom Orchestrator
#
agents
#
architecture
#
aws
#
llm
Comments
Add Comment
7 min read
HBM4 Didn't Break the Memory Wall — It Just Moved It
plasmon
plasmon
plasmon
Follow
Apr 8
HBM4 Didn't Break the Memory Wall — It Just Moved It
#
semiconductor
#
llm
#
hardware
#
ai
Comments
Add Comment
6 min read
How AI Apps Actually Use LLMs: Introducing RAG
Vaishali
Vaishali
Vaishali
Follow
Apr 8
How AI Apps Actually Use LLMs: Introducing RAG
#
ai
#
llm
#
rag
#
webdev
Comments
Add Comment
4 min read
Google Gemma 4: How a 31B Model Beats 600B+ Giants (Benchmarks + NVIDIA Co-Optimization)
정상록
정상록
정상록
Follow
Apr 8
Google Gemma 4: How a 31B Model Beats 600B+ Giants (Benchmarks + NVIDIA Co-Optimization)
#
news
#
ai
#
google
#
llm
Comments
Add Comment
2 min read
LLMKube Now Deploys Any Inference Engine, Not Just llama.cpp
Christopher Maher
Christopher Maher
Christopher Maher
Follow
Apr 8
LLMKube Now Deploys Any Inference Engine, Not Just llama.cpp
#
llm
#
opensource
#
ai
#
kubernetes
Comments
Add Comment
3 min read
Anthropic Just Released a Model So Dangerous They Gave It to Only Security Researchers
Aamer Mihaysi
Aamer Mihaysi
Aamer Mihaysi
Follow
Apr 8
Anthropic Just Released a Model So Dangerous They Gave It to Only Security Researchers
#
ai
#
security
#
anthropic
#
llm
Comments
Add Comment
2 min read
I built an Ollama alternative with TurboQuant, model groups, and multi-GPU support
deharoalexandre-cyber
deharoalexandre-cyber
deharoalexandre-cyber
Follow
Apr 8
I built an Ollama alternative with TurboQuant, model groups, and multi-GPU support
#
ai
#
llm
#
opensource
#
cpp
Comments
Add Comment
4 min read
Running Just One LLM on 8GB VRAM Is a Waste
plasmon
plasmon
plasmon
Follow
Apr 7
Running Just One LLM on 8GB VRAM Is a Waste
#
llm
#
machinelearning
#
python
#
ai
Comments
Add Comment
8 min read
Light Just Cut KV Cache Memory Traffic to 1/16th
plasmon
plasmon
plasmon
Follow
Apr 7
Light Just Cut KV Cache Memory Traffic to 1/16th
#
llm
#
photonics
#
semiconductor
#
inference
Comments
Add Comment
7 min read
Why Your Agent Doesn't Know What Time It Is
Art H
Art H
Art H
Follow
Apr 7
Why Your Agent Doesn't Know What Time It Is
#
agents
#
ai
#
architecture
#
llm
Comments
Add Comment
7 min read
The AI Stack: A Practical Guide to Building Your Own Intelligent Applications
Midas126
Midas126
Midas126
Follow
Apr 8
The AI Stack: A Practical Guide to Building Your Own Intelligent Applications
#
ai
#
machinelearning
#
llm
#
development
Comments
Add Comment
5 min read
ツール呼び出しでも大きいモデルは勝てなかった
plasmon
plasmon
plasmon
Follow
Apr 7
ツール呼び出しでも大きいモデルは勝てなかった
#
llm
#
ai
#
python
Comments
Add Comment
4 min read
I benchmarked GPT-4o, Claude 3.5, and Gemini 1.5 for security — the results
NY-squared2-agents
NY-squared2-agents
NY-squared2-agents
Follow
Apr 8
I benchmarked GPT-4o, Claude 3.5, and Gemini 1.5 for security — the results
#
ai
#
security
#
llm
#
benchmark
Comments
Add Comment
2 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account