[go: up one dir, main page]

927 projects for "visual" with 1 filter applied:

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • The #1 AI-Powered eLearning Platform Icon
    The #1 AI-Powered eLearning Platform

    For users seeking a platform to generate online courses using AI

    Transform your content into engaging eLearning experiences with Coursebox, the #1 AI-powered eLearning authoring tool. Our platform automates the course creation process, allowing you to design a structured course in seconds. Simply make edits, add any missing elements, and your course is ready to go. Whether you want to publish privately, share publicly, sell your course, or export it to your LMS, Coursebox has you covered.
    Learn More
  • 1
    Visual Blocks

    Visual Blocks

    Visual Blocks for ML is a Google visual programming framework

    ...Because everything lives in the browser, sharing is as simple as exporting a project or link, and collaborators can experiment without installing toolchains. For educators and product teams alike, Visual Blocks reduces the distance from idea to interactive proof-of-concept by turning ML diagrams.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Obsidian Visual Skills Pack

    Obsidian Visual Skills Pack

    Generate Canvas, Excalidraw, and Mermaid diagrams from text

    LLM-TLDR is a Python-based tool designed to dramatically reduce the amount of code a large language model needs to read by extracting the essential structure and context from a codebase and presenting only the most relevant parts to the model. Traditional approaches often dump entire files into a model’s context, which quickly exceeds token limits; LLM-TLDR instead indexes project structure, traces dependencies, and summarizes code in a way that preserves semantic relevance while shrinking...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    1D Visual Tokenization and Generation

    1D Visual Tokenization and Generation

    This repo contains the code for 1D tokenizer and generator

    The 1D Visual Tokenization and Generation project from ByteDance introduces a novel “one-dimensional” tokenizer designed for images: instead of representing images with large grids of 2D tokens (as in many prior generative/image-modeling systems), it compresses images into as few as 32 discrete tokens (or more, optionally) — thereby achieving a very compact, efficient representation that drastically speeds up generation and reconstruction while retaining strong fidelity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    DeepSeek-OCR 2

    DeepSeek-OCR 2

    Visual Causal Flow

    DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents with rich spatial structure. ...
    Downloads: 19 This Week
    Last Update:
    See Project
  • The CRM you will want to use every day Icon
    The CRM you will want to use every day

    With CRM, Sales, and Marketing Automation in one, Act! gives you everything you need for happier clients, more revenue, and less stress.

    Act! Premium is perfect for small and midsize businesses looking to market better, sell more, and create customers for life. With unparalleled flexibility and freedom of choice, Act! Premium accommodates the unique ways you do business. Whether it’s customizations to fit your specific business or industry processes or your preferences for deployment and access, the possibilities with Act! Premium are limitless.
    Learn More
  • 5
    Phaser Editor 2D

    Phaser Editor 2D

    A web-based IDE for HTML5 game development. Powered by Phaser

    Phaser Editor 2D v3 is a lightweight, multi-platform IDE for HTML5 game development using the Phaser framework. It combines visual tools and code editing capabilities, helping developers design scenes, UI, and game logic in an efficient and organized workflow.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 6
    C.A.F.E.

    C.A.F.E.

    The "Third Way" for Home Assistant Automations

    CAFE (Complex Automation Flow Editor) for Home Assistant is an innovative automation authoring tool that provides a visual flow editor inside Home Assistant, offering a “third way” between manual YAML and external flow tools by transpiling visual flows directly into native Home Assistant automation YAML. It’s designed to give users the visual clarity of tools like Node-RED without the overhead of a separate execution engine, storing the logic as standard YAML that runs in the core automation engine with zero added resource use. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Blockly Games

    Blockly Games

    Games for tomorrow's programmers

    ...Progression is scaffolded: earlier levels constrain the toolset, while later levels unlock more expressive blocks and optional textual views. The project’s emphasis on immediate visual feedback and forgiving iteration lowers the barrier for first-time programmers while still rewarding mastery.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 8
    RenJS

    RenJS

    RenJS Typescript Version

    RenJS V2 is a TypeScript-based visual novel engine built on Phaser, designed to help creators develop interactive storytelling experiences. It offers a script-like syntax for defining scenes, dialogues, and character interactions.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    SAM 3

    SAM 3

    Code for running inference and finetuning with SAM 3 model

    SAM 3 (Segment Anything Model 3) is a unified foundation model for promptable segmentation in both images and videos, capable of detecting, segmenting, and tracking objects. It accepts both text prompts (open-vocabulary concepts like “red car” or “goalkeeper in white”) and visual prompts (points, boxes, masks) and returns high-quality masks, boxes, and scores for the requested concepts. Compared with SAM 2, SAM 3 introduces the ability to exhaustively segment all instances of an open-vocabulary concept specified by a short phrase or exemplars, scaling to a vastly larger set of categories than traditional closed-set models. ...
    Downloads: 69 This Week
    Last Update:
    See Project
  • Cloud-based observability solution that helps businesses track and manage workload and performance on a unified dashboard. Icon
    Cloud-based observability solution that helps businesses track and manage workload and performance on a unified dashboard.

    For developers, engineers, and operational teams in organizations of all sizes

    Monitor everything you run in your cloud without compromising on cost, granularity, or scale. groundcover is a full stack cloud-native APM platform designed to make observability effortless so that you can focus on building world-class products. By leveraging our proprietary sensor, groundcover unlocks unprecedented granularity on all your applications, eliminating the need for costly code changes and development cycles to ensure monitoring continuity.
    Learn More
  • 10
    Unredact

    Unredact

    A simple tool for reading in poorly redacted documents

    ...Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and linguistic patterns to produce candidate reconstructions. It accepts a variety of input formats, automatically identifies redacted regions, and then generates text suggestions that are presented alongside visual overlays so users can choose or refine outputs.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 11
    CC Workflow Studio

    CC Workflow Studio

    Accelerate Claude Code/GitHub Copilot

    CC Workflow Studio is a powerful Visual Studio Code extension that accelerates AI-assisted development by providing a visual workflow editor tailored for AI automation and agent orchestration, particularly with tools like Claude Code, GitHub Copilot, OpenAI Codex, and others. The extension lets developers and creators design complex AI workflows using intuitive drag-and-drop canvases or via conversational AI commands, blending graphical editing with natural language refinement. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    Inkeep

    Inkeep

    Create AI Agents in a No-Code Visual Builder or TypeScript SDK

    Inkeep is an open-source framework for building and deploying AI agent workflows and interactive assistants that operate autonomously across applications, enterprise environments, and customer engagement use cases. It lets developers and non-technical users create, manage, and orchestrate multi-agent systems using both a no-code visual builder and a full TypeScript SDK, giving two ways to define agent behaviors that stay in sync with each other. Agents built with this framework can act as real-time conversational assistants — for example, handling help desk inquiries, providing internal support to teams, or driving in-app experiences — and they can be extended to automate multi-step tasks that interact with external systems like CRMs, knowledge bases, or ticketing systems. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    ClaudeBar

    ClaudeBar

    A macOS menu bar application that monitors AI coding assistant usage

    ...Rather than constantly running CLI commands or navigating web dashboards, users can glance at their quota statistics for services like Claude, Codex, Gemini, GitHub Copilot, and Antigravity directly from the menu bar. The application provides real-time tracking of session, weekly, and model-specific usage percentages, using visual indicators such as color-coded progress bars to communicate when quotas are healthy, nearing limits, or depleted. It includes options to enable or disable monitoring for individual providers, supports multiple visual themes (including dark mode and a festive theme), and refreshes data at configurable intervals so users always have up-to-date information.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Moondream

    Moondream

    Tiny vision language model

    ...It serves as both a playground for the author’s artistic curiosity and a resource for other creative coders interested in generative art techniques. The repository may include shaders, canvas/WebGL code, visual demos, and utilities that demonstrate how mathematical functions or noise patterns can be harnessed for compelling visuals.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    S&box

    S&box

    s&box is a modern game engine, built on Valve's Source 2

    ...Built on a cutting-edge game engine, s&box allows creators to prototype, build, and share interactive game modes, tools, and environments using C#, JavaScript, and visual scripting, promoting accessible content creation for developers of varying skill levels. The platform emphasizes multiplayer and community experiences, giving creators direct control over networking, physics, rendering, and audio without needing to build those systems from scratch. With real-time recompilation and fast iteration loops, developers can see changes instantly, speeding up the creative process dramatically compared to traditional engines. ...
    Downloads: 30 This Week
    Last Update:
    See Project
  • 16
    ComfyUI-LTXVideo

    ComfyUI-LTXVideo

    LTX-Video Support for ComfyUI

    ComfyUI-LTXVideo is a bridge between ComfyUI’s node-based generative workflow environment and the LTX-Video multimedia processing framework, enabling creators to orchestrate complex video tasks within a visual graph paradigm. Instead of writing code to apply effects, transitions, edits, and data flows, users can assemble nodes that represent video inputs, transformations, and outputs, letting them prototype and automate video production pipelines visually. This integration empowers non-programmers and rapid-iteration teams to harness the performance of LTX-Video while maintaining the clarity and flexibility of a dataflow graph model. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Effekseer

    Effekseer

    This software is a particle effect editing tool

    Effekseer is an open-source tool designed for creating stunning visual effects for games and multimedia applications. It allows artists and developers to design complex particle effects through an intuitive graphical user interface, supporting various features like particles, beams, and distortion effects. Effekseer supports multiple platforms and integrates seamlessly with popular game engines, enabling the creation of rich visual experiences without extensive programming knowledge.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    agentation

    agentation

    The visual feedback tool for agents

    Agentation is a visual annotation and feedback tool designed to make interacting with AI coding agents more intuitive and precise by letting developers visually click on frontend elements in a browser and annotate them with context before sending structured feedback to an agent. Instead of describing UI elements in text — like “the blue button in the sidebar” — users click directly on elements to automatically capture selectors, positions, and contextual metadata that can be consumed by AI agents to locate exact code references. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    diagrams.net

    diagrams.net

    Security-first diagramming for teams

    ...Office 365 app for Word, Powerpoint and Excel. Highest-rated Confluence app in the Atlassian Marketplace. Works in Jira too. Works with GitHub, GitLab and Dropbox for visual documentation in distributed teams. You can download diagrams.net Desktop for offline use on Windows, macOS, Linux and ChromeOS. Nextcloud is an ideal on-premise file hosting system for diagrams.net.
    Downloads: 134 This Week
    Last Update:
    See Project
  • 20
    drawio-desktop

    drawio-desktop

    Security-first diagramming for teams

    ...Office 365 app for Word, Powerpoint and Excel. Highest-rated Confluence app in the Atlassian Marketplace. Works in Jira too. Works with GitHub, GitLab and Dropbox for visual documentation in distributed teams. Download diagrams.net Desktop for offline use on Windows, macOS, Linux and ChromeOS. Nextcloud is an ideal on-premise file hosting system for diagrams.net. Entity relationship diagrams or ER models in software engineering show the structure of and relationships between database objects. They are used extensively in database modelling to plan new systems, and document existing systems for maintenance and updates.
    Downloads: 151 This Week
    Last Update:
    See Project
  • 21
    GLM-Image

    GLM-Image

    GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image

    ...It excels at generating images that include complex layouts and detailed text content, making it especially useful for posters, diagrams, info-graphics, social media graphics, and visual content that requires precise text placement and semantic alignment. Because it blends linguistic reasoning with image synthesis, GLM-Image produces visual outputs where semantic relationships and textual accuracy are prioritized alongside artistic style and realism, and its model structure enables it to handle dense visual knowledge tasks that challenge many pure diffusion models. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    Janus

    Janus

    Unified Multimodal Understanding and Generation Models

    Janus is a sophisticated open-source project from DeepSeek AI that aims to unify both visual understanding and image generation in a single model architecture. Rather than having separate systems for “look and describe” and “prompt and generate”, Janus uses an autoregressive transformer framework with a decoupled visual encoder—allowing it to ingest images for comprehension and to produce images from text prompts with shared internal representations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Dafthunk

    Dafthunk

    A workflow execution platform built on top of the fantastic Cloudflare

    ...It aims to combine the approachability of a visual editor with the practical needs of real automation: state persistence, execution history, reusable nodes, and integrations with external systems. A key appeal is that you can go from idea to running automation quickly in a hosted-like experience while still keeping the project open source and infrastructure-aware.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Depth Anything 3

    Depth Anything 3

    Recovering the Visual Space from Any Views

    Depth Anything 3 is a research-driven project that brings accurate and dense depth estimation to any input image or video, enabling foundational understanding of 3D structure from 2D visual content. Designed to work across diverse scenes, lighting conditions, and image types, it uses advanced neural networks trained on large, heterogeneous datasets, producing depth maps that reveal scene depth relationships and object surfaces with strong fidelity. The model can be applied to photography, AR/VR content creation, robotics perception, and 3D reconstruction workflows, making it versatile across industries and research domains. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 25
    StoryGen Atelier

    StoryGen Atelier

    AI-assisted storyboard and video generation tool

    StoryGen Atelier is an advanced creative tool that blends AI with visual storytelling, making it possible to generate fully structured storyboards and stitched videos from text prompts without requiring manual art or animation skills. Users begin with natural language descriptions of their story or scene, and the system uses state-of-the-art large models to generate both the script and corresponding frames.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next