[go: up one dir, main page]

Search Results for "audio and video stream" - Page 4

Showing 2457 open source projects for "audio and video stream"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Create custom docs, forms, apps, e-signatures, and surveys with Titan. Icon
    Create custom docs, forms, apps, e-signatures, and surveys with Titan.

    Powerful no-code digital experiences for Salesforce

    Create custom docs, forms, apps, e-signatures, and surveys with Titan’s full-suite of enterprise applications designed to integrate seamlessly with Salesforce data across your entire organization. #1 on the Salesforce appexchange
    Learn More
  • 1
    jfMedia Player

    jfMedia Player

    Media Player

    Media Player. Plays video and audio files. Supports RTSP network streams (auto reconnecting). Includes CLI to play file or network stream.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Navidrome

    Navidrome

    Your Personal Streaming Service

    Navidrome is an open-source, web-based personal music server that lets you stream and manage your entire music collection from any browser or compatible mobile app, effectively turning your own files into a cloud-accessible music service. It supports large libraries and handles a wide variety of audio formats while maintaining very low resource usage, so it runs well even on small servers, Raspberry Pi devices, and other constrained hardware.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 3
    iptv

    iptv

    Collection of publicly available IPTV channels from all over the world

    iptv-org/iptv is a community-maintained index of publicly available IPTV channels organized into a giant, continuously updated set of M3U playlists. Instead of hosting streams, the project curates links already available on the public internet and classifies them by country, language, category, and channel metadata. The repository’s structure makes it easy to consume at different granularities: you can grab a global list, a country-specific list, or a themed list such as news, sports, or...
    Downloads: 95 This Week
    Last Update:
    See Project
  • 4

    Subtitle-Workshop-Classic-v6.3.4

    Subtitle Editor derived from 6.0c, but with VLC and Hunspell checker

    Audio waveform, VLC Video Renderer, UTF8 coding, Audio stream detection and Selection, Resizeable screens, Hunspell spellcheck, Easy shortcut editing, user profiles and more than 70 filetypes supported.
    Leader badge">
    Downloads: 95 This Week
    Last Update:
    See Project
  • Create and manage the email signature you need Icon
    Create and manage the email signature you need

    For companies and organizations that need an email signature solution

    With WiseStamp it’s easy to unify your brand and turn your emails into a powerful marketing tool. Get the most out of your emails with a professionally designed custom email signature.
    Learn More
  • 5
    Kaset

    Kaset

    The missing YouTube Music macOS app

    Kaset is a social audio platform framework that allows users to host, share, and interact with audio content in community-oriented spaces, combining elements of podcasting, voice rooms, and feedback-driven discovery. It provides an interface where creators can upload episodes, host live or scheduled voice sessions, and cultivate listener communities through comments, reactions, and follow systems. The platform emphasizes audio discovery with playlists, curated channels, and trending audio...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Nextcloud Talk

    Nextcloud Talk

    Video- & audio-conferencing app for Nextcloud

    Nextcloud Talk is the official chat, video and audio conferencing app for Nextcloud that allows users to chat, call and screenshare with multiple other users. Nextcloud offers better protection for your communication as it provides end-to-end encryption and keeps even metadata from leaking. You can have private, group, public or password protected calls by simply inviting one person, a whole group, or sending a public link as an invitation to a call.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 7
    Next Player

    Next Player

    An Android native video player

    Next Player is an Android native video player written in Kotlin. It provides a simple and easy-to-use interface for users to play videos on their Android devices.
    Downloads: 44 This Week
    Last Update:
    See Project
  • 8
    Video Download++

    Video Download++

    Free video downloader for YouTube and hundreds of other websites

    Video Download++ is a Windows app that lets you download videos from various websites, including YouTube, Tiktok, Vimeo, Bilibili and various media channels. Just copy a video URL, paste it in Vividl and download the video in different qualities, convert it to other formats or to an audio track. You can also manage the parallel download of multiple videos and retrieve additonal meta information using the simple UI.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 9
    Yattee

    Yattee

    Privacy oriented video player for iOS, tvOS and macOS

    Privacy-oriented video player for iOS, tvOS and macOS. Native user interface built with SwiftUI with customization settings. Player queue and history. Player component with custom controls, gestures and support for 4K playback. Fullscreen, Picture in Picture and background audio playback. SponsorBlock, configurable categories to skip.
    Downloads: 51 This Week
    Last Update:
    See Project
  • Composable, Open Source Payments Platform Icon
    Composable, Open Source Payments Platform

    Build or enhance your payments stack, while maintaining control with an open-source, full-stack and modular infrastructure.

    Juspay's Payments Orchestration Platform offers a comprehensive product suite for businesses, including open-source payment orchestration, global payouts, seamless authentication, payment tokenization, fraud & risk management, end-to-end reconciliation, unified payment analytics & more. The company’s offerings also include end-to-end white label payment gateway solutions & real-time payments infrastructure for banks. These solutions help businesses achieve superior conversion rates, reduce fraud, optimize costs, and deliver seamless customer experiences at scale.
    Learn More
  • 10
    RtspSimpleServer

    RtspSimpleServer

    ready-to-use RTSP / RTMP / LL-HLS / WebRTC server and proxy

    rtsp-simple-server is a ready-to-use and zero-dependency server and proxy that allows users to publish, read and proxy live video and audio streams. Publish live streams to the server Read live streams from the server. Proxy streams from other servers or cameras, always or on-demand. Streams are automatically converted from a protocol to another. For instance, it's possible to publish a stream with RTSP and read it with HLS. Serve multiple streams at once in separate paths Authenticate users; use internal or external authentication. ...
    Downloads: 112 This Week
    Last Update:
    See Project
  • 11
    Qwen3-Omni

    Qwen3-Omni

    Qwen3-omni is a natively end-to-end, omni-modal LLM

    ...It achieves state-of-the-art results: across 36 audio and audio-visual benchmarks, it hits open-source SOTA on 32 and overall SOTA on 22, outperforming or matching strong closed-source models such as Gemini-2.5 Pro and GPT-4o. To reduce latency, especially in audio/video streaming, Talker predicts discrete speech codecs via a multi-codebook scheme and replaces heavier diffusion approaches.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    Membrane Core

    Membrane Core

    The core of Membrane Framework, multimedia processing framework

    membrane_core is the foundation of the Membrane multimedia framework for Elixir, providing the abstractions and runtime needed to build real-time audio and video pipelines. It models media processing as a graph of lightweight, supervised OTP processes—elements connected by links—so work is isolated, fault-tolerant, and easy to scale or reconfigure at runtime. The core defines a clear lifecycle and callback API for elements, plus concepts like buffers, events, and capabilities/format negotiation to keep components interoperable and type-safe. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ​Supports microphone recording and real-time processing in most of the implemented getUserMediamobile and PC browsers, mainly including Chrome, Firefox, Safari, iOS 14.3+, Android WebView, Tencent Android X5 kernel (QQ, WeChat, Mini Program WebView) , uni-app (App, H5), and most Android phones updated after 2021 have their own browsers; do not support: UC-based kernel (typical Alipay), most of the old domestic mobile phones that have not been updated have their own browsers and any other form of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    Amazon Chime SDK for Android

    Amazon Chime SDK for Android

    An Android client library for integrating multi-party communications

    The Amazon Chime SDK for Android makes it easy to add collaborative audio calling, video calling, and screen share viewing features to Android applications by using the same infrastructure services that power meetings on the Amazon Chime service. This Amazon Chime SDK for Android works by connecting to meeting session resources that you have created in your AWS account. The SDK has everything you need to build custom calling and collaboration experiences in your Android application, including methods to: configure meeting sessions, list and select audio devices, switch video devices, start and stop screen share viewing, and receive callbacks when media events occur such as volume changes, and manage meeting features such as audio mute and video tile bindings. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Amazon Chime SDK for JavaScript

    Amazon Chime SDK for JavaScript

    A JavaScript client library for integrating multi-party communications

    The Amazon Chime SDK is a set of real-time communications components that developers can use to quickly add messaging, audio, video, and screen sharing capabilities to their web or mobile applications. Developers can build on AWS's global communications infrastructure to deliver engaging experiences in their applications. For example, they can add video to a health application so patients can consult remotely with doctors on health issues, or create customized audio prompts for integration with the public telephone network. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    SALMONN family

    SALMONN family

    A suite of advanced multi-modal LLMs

    SALMONN is a family of advanced multi-modal large language models (LLMs) developed by ByteDance — designed to handle and integrate multiple data modalities (e.g. text, audio, video) rather than just plain text. The repository bundles different branches targeting specialized tasks (e.g. video-SALMONN, speech-quality assessment, general multimodal tasks), suggesting that the project is modular and extensible across domains. SALMONN aims to push the frontier of multi-modal AI by allowing models to process and reason over diverse inputs, which can be useful for applications such as video understanding, speech analytics, cross-modal retrieval, and general AI capable of interpreting rich, multi-sensory data. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    ...At inference, the stream from the user is taken from the audio input, and the one for Moshi is sampled from the model's output. Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    MediaDevices

    MediaDevices

    Go implementation of the MediaDevices API

    mediadevices is a Go library developed by the Pion WebRTC team that enables real-time access to audio and video devices for building native Go applications involving media streaming and conferencing. It provides a cross-platform, unified API for capturing and manipulating media streams and is often used in combination with Pion WebRTC for peer-to-peer communications. Its support for device enumeration, media constraints, and frame processing makes it a powerful building block for custom voice and video solutions in Go.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Vidi2

    Vidi2

    Large Multimodal Models for Video Understanding and Editing

    Vidi is a family of large multimodal models developed for deep video understanding and editing tasks, integrating vision, audio, and language to allow sophisticated querying and manipulation of video content. It’s designed to process long-form, real-world videos and answer complex queries such as “when in this clip does X happen?” or “where in the frame is object Y during that moment?” — offering temporal retrieval, spatio-temporal grounding (i.e. locating objects over time + space), and even video question answering. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Amazon Chime SDK React Components

    Amazon Chime SDK React Components

    Chime React Component Library with integrations with the Amazon SDK

    The Amazon Chime SDK makes it easy to add collaborative audio calling, video calling, and screen share features to web applications by using the same infrastructure services that power millions of Amazon Chime online meetings. The Amazon Chime SDK React Component Library supplies client-side state management and reusable UI components for common web interfaces used in audio and video conferencing applications, including: video tile grids, microphone activity indicators, and call controls. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Peer Calls

    Peer Calls

    Group peer to peer video calls for everyone written in Go

    Peer Calls is a self-hosted, open-source WebRTC-based video and audio calling platform for group communication. Designed for simplicity and privacy, it allows anyone to run their own video conferencing service without relying on third-party providers. Peer Calls supports multi-user rooms, screen sharing, and chat, all delivered via a clean web interface. It’s great for small teams, communities, and educational groups seeking secure and customizable alternatives to mainstream conferencing tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    HunyuanVideo

    HunyuanVideo

    HunyuanVideo: A Systematic Framework For Large Video Generation Model

    HunyuanVideo is a cutting-edge framework designed for large-scale video generation, leveraging advanced AI techniques to synthesize videos from various inputs. It is implemented in PyTorch, providing pre-trained model weights and inference code for efficient deployment. The framework aims to push the boundaries of video generation quality, incorporating multiple innovative approaches to improve the realism and coherence of the generated content. Release of FP8 model weights to reduce GPU...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    Laravel FFMpeg

    Laravel FFMpeg

    This package provides an integration with FFmpeg for Laravel

    ...You can easily add a watermark using the addWatermark method. With the WatermarkFactory, you can open your watermark file from a specific disk, just like opening an audio or video file. When you discard the fromDisk method, it uses the default disk specified in the filesystems.php configuration file.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 24
    Internet Friendly Media Encoder

    Internet Friendly Media Encoder

    Simple, Flexible & Powerful H.265/HEVC & H266/VVC video encoder!

    A versatile, future-proof, and expandable multimedia encoder designed for ease of use. Internet Friendly Media Encoder supports adding subtitles and attachments, and can function as a remuxing tool with a Copy Stream option. It allows you to combine multiple video, audio, subtitle, and attachment streams into a single file, remove unwanted streams, or incorporate subtitles from another video without extracting. The software also supports AviSynth for advanced video processing. The video and audio encoders are implemented in a plug-in style, enabling users to add their own compiled, optimized CPU architectures for increased speed. ...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 25
    FFmpeg Batch AV Converter

    FFmpeg Batch AV Converter

    Free all in one audio/video ffmpeg batch encoder

    FFmpeg Batch AV Converter is a free universal audio and video encoder for Windows and Linux (via Wine), that allows to use the full potential of ffmpeg command line with a few mouse clicks in a convenient GUI with drag and drop, progress information. Some fancy wizards make things easy for non-experts. Thanks to its multi-file encoding feature, it may be the fastest a/v batch encoder available, since it maximizes system resources usage by launching as many simultaneous processes up to user cpu thread count. ...
    Leader badge">
    Downloads: 1,625 This Week
    Last Update:
    See Project