[go: up one dir, main page]

Showing 238 open source projects for "gpt-sovits"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Combine Jira and SCM data to improve team performance Icon
    Combine Jira and SCM data to improve team performance

    For engineering leaders who need to foster alignment with the business and streamline their operations for better efficiency and higher productivity

    Jellyfish is the leading Engineering Management Platform, providing complete visibility into engineering organizations, the work they do, and how they operate. By analyzing engineering signals from Git and Jira, qualitative team feedback, and contextual business data from roadmapping, incident response, HR, calendar, and collaboration tools, Jellyfish enables engineering leaders to align engineering decisions with business initiatives and deliver the right software, efficiently, on time. With Jellyfish, engineering leaders can focus their teams on what matters most to the business, driving strategic decisions and delivering results.
    Learn More
  • 1
    LangExtract

    LangExtract

    A Python library for extracting structured information

    ...Each extracted entity is precisely grounded in its original context, allowing visual inspection and validation via automatically generated interactive HTML visualizations. LangExtract supports a wide range of models, including Google Gemini, OpenAI GPT, and local LLMs via Ollama, making it adaptable to different deployment environments and compliance needs. The system excels at handling long documents using optimized chunking, multi-pass extraction, and parallel processing to ensure both high recall and structured consistency.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Transformer Engine

    Transformer Engine

    A library for accelerating Transformer models on NVIDIA GPUs

    ...TE also includes a framework-agnostic C++ API that can be integrated with other deep-learning libraries to enable FP8 support for Transformers. As the number of parameters in Transformer models continues to grow, training and inference for architectures such as BERT, GPT, and T5 become very memory and compute-intensive. Most deep learning frameworks train with FP32 by default. This is not essential, however, to achieve full accuracy for many deep learning models.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    node-red-contrib-custom-chatgpt
    A Node-RED node that interacts with OpenAI machine learning models like "ChatGPT". Install with the built-in Node-RED Palette manager. When editing the properties of the node, to get your OPENAI_API_KEY log in to ChatGPT. Create a new secret key" then copy and paste the "API key" into the node API_KEY property value. msg.payload should be a well-written prompt that provides enough information for the model to know what you want and how it should respond. Its success generally depends on the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Colossal-AI

    Colossal-AI

    Making large AI models cheaper, faster and more accessible

    ...This imposes challenges to the memory wall of the current accelerator hardware such as GPU. It is never ideal to train large models such as Vision Transformer, BERT, and GPT on a single GPU or a single machine. There is an urgent demand to train models in a distributed environment. However, distributed training, especially model parallelism, often requires domain expertise in computer systems and architecture. It remains a challenge for AI researchers to implement complex distributed training solutions for their models. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Optimize every aspect of hiring with Greenhouse Recruiting Icon
    Optimize every aspect of hiring with Greenhouse Recruiting

    Hire for what’s next.

    What’s next for many of us is changing. Your company’s ability to hire great talent is as important as ever – so you’ll be ready for whatever’s ahead. Whether you need to scale your team quickly or improve your hiring process, Greenhouse gives you the right technology, know-how and support to take on what’s next.
    Learn More
  • 5
    OpenAI Realtime Agents

    OpenAI Realtime Agents

    This is a simple demonstration of more advanced, agentic patterns

    ...It also supports a “Chat-Supervisor” pattern where a lightweight realtime chat agent handles user interactions and delegates more complex reasoning or tool usage to a stronger textual model (e.g. GPT-4). Because realtime agents are still a beta feature, the code and API surface are subject to changes and may evolve.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Zypher Agent

    Zypher Agent

    A minimal yet powerful framework for creating AI agents

    Zypher Agent is an open-source framework for building full-featured AI agents that can be embedded directly into applications, enabling reactive decision loops where the agent dynamically chooses its next actions. Unlike workflow-style orchestrators, it uses a reactive agent loop that interprets the task, reasons about next steps via LLMs, and integrates directly with extensible tools and external services. Zypher prioritizes native support for multiple model providers such as OpenAI and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    WavTokenizer

    WavTokenizer

    SOTA discrete acoustic codec models with 40/75 tokens per second

    WavTokenizer is a state-of-the-art discrete acoustic codec designed specifically for audio language modeling, capable of compressing 24 kHz audio into just 40 or 75 tokens per second while preserving high perceptual quality. It is built to represent speech, music, and general audio with extremely low bitrate, making it ideal as a front-end for large audio language models like GPT-4o and similar architectures. The model uses a single-quantizer design together with temporal compression to achieve extreme compression without sacrificing reconstruction fidelity. Its architecture incorporates a broader vector-quantization space, extended contextual windows, and improved attention networks, combined with multi-scale discriminators and inverse Fourier transform blocks to enhance waveform reconstruction. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Griptape

    Griptape

    Python framework for AI workflows and pipelines with chain of thought

    The Griptape framework provides developers with the ability to create AI systems that operate across two dimensions: predictability and creativity. For predictability, Griptape enforces structures like sequential pipelines, DAG-based workflows, and long-term memory. To facilitate creativity, Griptape safely prompts LLMs with tools (keeping output data off prompt by using short-term memory), which connects them to external APIs and data stores. The framework allows developers to transition...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    ChatGPT Academic

    ChatGPT Academic

    ChatGPT extension for scientific research work

    ...All buttons are dynamically generated by reading functional.py, you can add custom functions at will, and liberate the pasteboard. Support for markdown tables output by GPT. If the output contains a formula, it will be displayed in tex form and rendered form at the same time, which is convenient for copying and reading.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Monitoring, Securing, Optimizing 3rd party scripts Icon
    Monitoring, Securing, Optimizing 3rd party scripts

    For developers looking for a solution to monitor, script, and optimize 3rd party scripts

    c/side is crawling many sites to get ahead of new attacks. c/side is the only fully autonomous detection tool for assessing 3rd party scripts. We do not rely purely on threat feed intel or easy to circumvent detections. We also use historical context and AI to review the payload and behavior of scripts.
    Learn More
  • 10
    GPTel

    GPTel

    A no-frills ChatGPT client for Emacs

    ...It will ask you for the key if you skipped the previous step. Run it with a prefix-arg to start a new session. In the gptel buffer, send your prompt with M-x gptel-send, bound to C-c RET. Set chat parameters (GPT model, directives etc) for the session by calling gptel-send with a prefix argument.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    spacy-transformers

    spacy-transformers

    Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

    spaCy supports a number of transfer and multi-task learning workflows that can often help improve your pipeline’s efficiency or accuracy. Transfer learning refers to techniques such as word vector tables and language model pretraining. These techniques can be used to import knowledge from raw text into your pipeline, so that your models are able to generalize better from your annotated examples. You can convert word vectors from popular tools like FastText and Gensim, or you can load in any...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Penzai

    Penzai

    A JAX research toolkit to build, edit, & visualize neural networks

    Penzai, developed by Google DeepMind, is a JAX-based library for representing, visualizing, and manipulating neural network models as functional pytree data structures. It is designed to make machine learning research more interpretable and interactive, particularly for tasks like model surgery, ablation studies, architecture debugging, and interpretability research. Unlike conventional neural network libraries, Penzai exposes the full internal structure of models, enabling fine-grained...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Tarsier

    Tarsier

    Vision utilities for web interaction agents

    At Reworkd, we iterated on all these problems across tens of thousands of real web tasks to build a powerful perception system for web agents... Tarsier! In the video below, we use Tarsier to provide webpage perception for a minimalistic GPT-4 LangChain web agent. Tarsier visually tags interactable elements on a page via brackets + an ID e.g. [23]. In doing this, we provide a mapping between elements and IDs for an LLM to take actions upon (e.g. CLICK [23]). We define interactable elements as buttons, links, or input fields that are visible on the page; Tarsier can also tag all textual elements if you pass tag_text_elements=True. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Qwen3-Omni

    Qwen3-Omni

    Qwen3-omni is a natively end-to-end, omni-modal LLM

    ...It achieves state-of-the-art results: across 36 audio and audio-visual benchmarks, it hits open-source SOTA on 32 and overall SOTA on 22, outperforming or matching strong closed-source models such as Gemini-2.5 Pro and GPT-4o. To reduce latency, especially in audio/video streaming, Talker predicts discrete speech codecs via a multi-codebook scheme and replaces heavier diffusion approaches.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    LLaVA

    LLaVA

    Visual Instruction Tuning: Large Language-and-Vision Assistant

    Visual instruction tuning towards large language and vision models with GPT-4 level capabilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Haystack

    Haystack

    Haystack is an open source NLP framework to interact with your data

    ...Perform semantic search and retrieve ranked documents according to meaning, not just keywords! Make use of and compare the latest pre-trained transformer-based languages models like OpenAI’s GPT-3, BERT, RoBERTa, DPR, and more. Pick any Transformer model from Hugging Face's Model Hub, experiment, find the one that works. Use Haystack NLP components on top of Elasticsearch, OpenSearch, or plain SQL. Boost search performance with Pinecone, Milvus, FAISS, or Weaviate vector databases, and dense passage retrieval.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    ChatGPT Java

    ChatGPT Java

    A Java client for the ChatGPT API

    ChatGPT Java is a Java client for the ChatGPT API. Use official API with model gpt-3.5-turbo.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    VividNode

    VividNode

    Multi-purpose Text & Image Generation Desktop Chatbot

    A cross-platform AI desktop chatbot application for LLM such as GPT, Claude, Gemini, Llama chatbot interaction and image generation, offering customizable features, local chat history, and enhanced performance—no browser required!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    qt-fsarchiver

    Program for Backup and restore partitions

    qt-fsarchiver is a GUI for the program fsarchiver to save/restore partitions, folders and MBR/GPT. Clone from hard disk and partitions is possible. The program is for systems based Debian, for Suse and Fedora. Look: (German language) http://wiki.ubuntuusers.de/qt-fsarchiver. Language: German, English, Spain, Russia, Chinese and Italian and more. Notes in Readme (Liesmich) for install in Suse, Fedora and Debian based systems.
    Leader badge">
    Downloads: 53 This Week
    Last Update:
    See Project
  • 20
    GParted

    GParted

    A partition editor to graphically manage disk partitions

    GNOME Partition Editor for creating, reorganizing, and deleting disk partitions. It uses libparted from the parted project to detect and manipulate partition tables. Optional file system tools permit managing file systems not included in libparted.
    Leader badge">
    Downloads: 30,920 This Week
    Last Update:
    See Project
  • 21
    This a helpful miner for GPT and HYIP
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    AIEditorApp

    AIEditorApp

    Privacy-first desktop application for LLMs and AI APIs

    🚀 Cross-platform desktop application. 💻 Works on Mac, Windows, and Linux. 🗂️ All chat data is stored locally. 🗄️ Local templates and logs. 🤖 Use any AI model of your choice.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Ventoy

    Ventoy

    A New Bootable USB Solution

    Ventoy is an open source tool to create bootable USB drive for ISO/WIM/IMG/VHD(x)/EFI files. With Ventoy, you don't need to format the disk over and over, you just need to copy the ISO/WIM/IMG/VHD(x)/EFI files to the USB drive and boot them directly. You can copy many files at a time and Ventoy will give you a boot menu to select them. You can also browse ISO/WIM/IMG/VHD(x)/EFI files in local disks and boot them. x86 Legacy BIOS, IA32 UEFI, x86_64 UEFI, ARM64 UEFI and MIPS64EL...
    Leader badge">
    Downloads: 212,159 This Week
    Last Update:
    See Project
  • 24
    Windows Install

    Windows Install

    Installing Windows from macOS. Suitable for Hackintosh and Macintosh

    The Windows Install.app program allows you to install Windows directly from the Mac OS system. There is no need to create an installation flash drive. Suitable for Hackintosh and Macintosh (install drivers yourself). Possibility to make a backup. There is a drag and drop support function. Compatible with Mac OS X 10.13 and up. The utilities used are wimlib and ntfs-3g and others.
 Disk access must be granted (shown in the screenshot) The user must be an administrator and the...
    Leader badge">
    Downloads: 998 This Week
    Last Update:
    See Project
  • 25
    Mr. Ranedeer

    Mr. Ranedeer

    GPT-4 AI Tutor Prompt for customizable personalized learning

    Unlock the potential of GPT-4 with Mr. Ranedeer AI Tutor, a customizable prompt that delivers personalized learning experiences for users with diverse needs and interests.
    Downloads: 2 This Week
    Last Update:
    See Project