ChatLLM Web

Chat with LLM like Vicuna totally in your browser with WebGPU, safely, privately, and with no server. Powered By web-llm. To use this app, you need a browser that supports WebGPU, such as Chrome 113 or Chrome Canary. Chrome versions ≤ 112 are not supported. You will need a GPU with about 6.4GB of memory. If your GPU has less memory, the app will still run, but the response time will be slower. The first time you use the app, you will need to download the model. For the Vicuna-7b model that we are currently using, the download size is about 4GB. After the initial download, the model will be loaded from the browser cache for faster usage.

Features

Everything runs inside the browser with no server support and is accelerated with WebGPU
Model runs in a web worker, ensuring that it doesn't block the user interface and providing a seamless experience
Easy to deploy for free with one-click on Vercel in under 1 minute, then you get your own ChatLLM Web
Model caching is supported, so you only need to download the model once
Multi-conversation chat, with all data stored locally in the browser for privacy
Markdown and streaming response support: math, code highlighting, etc.

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow ChatLLM Web

ChatLLM Web Web Site

User Reviews

Be the first to post a review of ChatLLM Web!

Additional Project Details

Programming Language

JavaScript

Related Categories

JavaScript Large Language Models (LLM)

Registered

2023-08-25

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
PygmalionAI

PygmalionAI is a community dedicated to creating open-source projects based on EleutherAI's GPT-J 6B and Meta's LLaMA models. In simple terms, Pygmalion makes AI fine-tuned for chatting and roleplaying purposes. The current actively supported Pygmalion AI model is the 7B variant, based on Meta...

See Software
Mistral Large 3

Mistral Large 3 is a next-generation, open multimodal AI model built with a powerful sparse Mixture-of-Experts architecture featuring 41B active parameters out of 675B total. Designed from scratch on NVIDIA H200 GPUs, it delivers frontier-level reasoning, multilingual performance, and advanced...

See Software
JinaChat

Experience JinaChat, a pioneering LLM service tailored for pro users. JinaChat ushers in a new era of multimodal chat capabilities, extending beyond text to incorporate images and more. Delight in our offer of free short interactions under 100 tokens. Our API empowers developers to leverage long...

See Software

Report inappropriate content

ChatLLM Web

Chat with LLM like Vicuna totally in your browser with WebGPU

Get an email when there's a new version of ChatLLM Web

Features

Project Samples

Project Activity

Categories

License

Follow ChatLLM Web

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered