Chat with LLM like Vicuna totally in your browser with WebGPU, safely, privately, and with no server. Powered By web-llm. To use this app, you need a browser that supports WebGPU, such as Chrome 113 or Chrome Canary. Chrome versions ≤ 112 are not supported. You will need a GPU with about 6.4GB of memory. If your GPU has less memory, the app will still run, but the response time will be slower. The first time you use the app, you will need to download the model. For the Vicuna-7b model that we are currently using, the download size is about 4GB. After the initial download, the model will be loaded from the browser cache for faster usage.

Features

  • Everything runs inside the browser with no server support and is accelerated with WebGPU
  • Model runs in a web worker, ensuring that it doesn't block the user interface and providing a seamless experience
  • Easy to deploy for free with one-click on Vercel in under 1 minute, then you get your own ChatLLM Web
  • Model caching is supported, so you only need to download the model once
  • Multi-conversation chat, with all data stored locally in the browser for privacy
  • Markdown and streaming response support: math, code highlighting, etc.

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow ChatLLM Web

ChatLLM Web Web Site

You Might Also Like
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of ChatLLM Web!

Additional Project Details

Programming Language

JavaScript

Related Categories

JavaScript Large Language Models (LLM)

Registered

2023-08-25