[go: up one dir, main page]

Showing 315 open source projects for "big data"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • deskbird is the most intuitive desk booking app for your hybrid office. Icon
    deskbird is the most intuitive desk booking app for your hybrid office.

    With deskbird, creating an efficient workplace has never been easier.

    For companies in need of a people-centric workplace management solution so employees can see who is in the office, schedule their office and work-from-home days, and book resources for office days.
    Learn More
  • 1
    IPyPlot

    IPyPlot

    Fast and efficient plotting of images inside Python Notebooks

    IPyPlot is a small python package offering fast and efficient plotting of images inside Python Notebooks. It's using IPython with HTML for faster, richer and more interactive way of displaying big numbers of images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    zpdf

    zpdf

    Zero-copy PDF text extraction library written in Zig

    zpdf is a high-performance PDF text extraction library written in Zig that focuses on speed, low overhead, and modern parsing techniques. It leans heavily on memory-mapped file reading and zero-copy patterns where possible, so it can scan large PDFs without repeatedly copying data around in memory. The library supports streaming extraction using efficient arena allocation, making it well suited for workloads that need to process big documents quickly or in batches. It implements multiple PDF decompression filters and handles common font encoding pathways, which are essential for turning raw PDF content streams into readable text. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Angel

    Angel

    A Flexible and Powerful Parameter Server for large-scale ML

    Angel is a high-performance distributed machine learning and graph computing platform based on the philosophy of Parameter Server. It is tuned for performance with big data from Tencent and has a wide range of applicability and stability, demonstrating an increasing advantage in handling higher-dimension models. Angel is jointly developed by Tencent and Peking University, taking account of both high availability in industry and innovation in academia. With a model-centered core design concept, Angel partitions the parameters of complex models into multiple parameter-server nodes and implements a variety of machine learning algorithms and graph algorithms using efficient model-updating interfaces and functions, as well as a flexible consistency model for synchronization. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    huihut interview

    huihut interview

    A summary of C/C++ technical interview basics

    ...It’s organized to be approachable whether you’re a student preparing for your first internship or an experienced engineer brushing up on fundamentals before a big interview round.
    Downloads: 0 This Week
    Last Update:
    See Project
  • viewneo - Smart software for digital advertising boards Icon
    viewneo - Smart software for digital advertising boards

    Smart digital signage for 1 to 1000+ screens.

    viewneo is a user-friendly, cloud-based solution that allows companies of all sizes to set up digital signage
    Learn More
  • 5
    Counter

    Counter

    Web Analytics made simple

    ...The backend is designed to be resource-light while still providing the essential metrics teams care about, such as page views, referrers, and basic geography. Rather than storing everything in a monolithic database, it combines a fast in-memory store for hot data with archival to a traditional SQL store for long-term retention. The client snippet is tiny and cookieless, which helps reduce layout shift and keeps page performance high. Because it’s a small Go service with a straightforward architecture, it can be self-hosted easily and integrated into existing stacks. The philosophy is to deliver “just enough” analytics for dashboards and reporting without the overhead or tracking footprint of big-box platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Nano Events

    Nano Events

    Simple and tiny (107 bytes) event emitter library for JavaScript

    Nano Events is a minimalistic, high-performance event emitter library for JavaScript. Its goal is to provide the simplest possible API to add pub/sub capabilities (emitters and listeners) to any JS object or application, while keeping overhead and bundle size extremely small. Rather than offering many complex features, nanoevents focuses on the core primitives: creating an emitter, subscribing to named events, emitting events with arbitrary data, and unsubscribing. Because of its minimal API...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MOA - Massive Online Analysis

    MOA - Massive Online Analysis

    Big Data Stream Analytics Framework.

    A framework for learning from a continuous supply of examples, a data stream. Includes classification, regression, clustering, outlier detection and recommender systems. Related to the WEKA project, also written in Java, while scaling to adaptive large scale machine learning.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 8
    gravitino

    gravitino

    Unified metadata lake for data & AI assets.

    Apache Gravitino is a high-performance, geo-distributed, and federated metadata lake. It manages metadata directly in different sources, types, and regions, providing users with unified metadata access for data and AI assets.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 9
    DataSophon

    DataSophon

    The next generation of cloud-native big data management expert

    Aiming at quickly deploying, managing, monitoring and automating the operation and maintenance of Big Data service components and nodes, helping you quickly build stable, efficient Big Data cluster services. The Three-Body Problem, a Hugo Award-winning work of the world's highest science fiction literature, is known for its stunning "hard science fiction" style, and its author Liu Cixin is credited with "single-handedly raising Chinese science fiction to a world-class level". ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • PageDNA: Web-to-Print eCommerce Software Icon
    PageDNA: Web-to-Print eCommerce Software

    eCommerce for Print, Signs and Fulfillment Trusted by In‑Plants and Commercial Print Leaders

    PageDNA enables successful eCommerce strategies for commercial print sales organizations, internal print shops, and brand owners. PageDNA’s online ordering platform increases print volume while decreasing touch costs for all stakeholders: clientele, print operations, and the organizations they support.
    Learn More
  • 10
    HPCC Systems

    HPCC Systems

    End-to-end big data in a massively scalable supercomputing platform.

    HPCC Systems® (www.hpccsystems.com) from LexisNexis® Risk Solutions is a proven, open source solution for Big Data insights that can be implemented by businesses of all sizes. With HPCC Systems, developers can design applications with Big Data at their core, enabling businesses to better analyze and understand data at scale, improving business time to results and decisions. HPCC Systems offers a consistent data-centric programming language, two processing platforms and a single, complete end-to-end architecture for efficient processing. ...
    Leader badge">
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Parkiet

    Parkiet

    Parquet format file GUI editor

    Parquet file viewer and editor written in Java and SWT. It uses Apache Avro library for reading and writing edited parquet files. Only Parquet files with simple data type columns are supported.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12
    qvge

    qvge

    Qt Visual Graph Editor

    ...Its main goal is to make possible visually edit two-dimensional graphs in a simple and intuitive way. Please note that qvge is not a replacement for such a software like Gephi, Graphvis, Dot, yEd, Dia and so on. It is neither a tool for "big data analysis" nor a math application. It is really just a simple graph editor :)
    Downloads: 15 This Week
    Last Update:
    See Project
  • 13
    PANDA

    PANDA

    A comprehensive and flexible quantification tool for proteomics data

    ...On the levels of spectra, peptides and proteins, PANDA works out a few quantitative filters and new scores for quantification confidence. Third, PANDA is designed for processing proteomics big data in parallel.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    json-scada

    A portable SCADA/IoT platform centered on the MongoDB database server.

    ...MongoDB as the real-time core database, persistence layer, config store, SOE historian. Portability and interoperability over Linux, Windows, x86/64, ARM. Horizontal scalability, from a single computer to big clusters (MongoDB-sharding), Bare Metal, Docker containers, VM, cloud, or hybrid deployments. Unlimited tags, servers, and users. HTML5 Web interface. UTF-8/I18N. Protocols: IEC61850 Client, IEC60870-5-101/104 Client and Server, DNP3 Client, OPC-UA Client/Server, MQTT/Sparkplug-B, Telegraf (various data sources for monitoring like Modbus, SNMP, etc.) ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    Pentaho

    Pentaho

    Pentaho offers comprehensive data integration and analytics platform.

    Pentaho couples data integration with business analytics in a modern platform to easily access, visualize and explore data that impacts business results. Use it as a full suite or as individual components that are accessible on-premise, in the cloud, or on-the-go (mobile). Pentaho enables IT and developers to access and integrate data from any source and deliver it to your applications all from within an intuitive and easy to use graphical tool. The Pentaho Enterprise Edition Free Trial...
    Leader badge">
    Downloads: 1,261 This Week
    Last Update:
    See Project
  • 16
    Old File Delete

    Old File Delete

    Clean up old files with a single click.

    OldFileDelete (OFD) is a lightweight and efficient utility designed for those who value minimalism and order. The app helps you instantly clear selected folders of accumulated digital clutter. Featuring a modern flat design, the interface is intuitive: simply select a folder, specify the number of days, and the program will find and remove outdated files. No complex settings—just cleanliness and speed.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Make sure to download from the link below and not the big giant button. I'm not sure how to fix that, so if you know!
    Leader badge">
    Downloads: 17 This Week
    Last Update:
    See Project
  • 18
    Maxima -- GPL CAS based on DOE-MACSYMA

    Maxima -- GPL CAS based on DOE-MACSYMA

    Computer Algebra System written in Common Lisp

    Maxima is a computer algebra system comparable to commercial systems like Mathematica and Maple. It emphasizes symbolic mathematical computation: algebra, trigonometry, calculus, and much more. For example, Maxima solves x^2-r*x-s^2-r*s=0, giving the symbolic results [x=r+s, x=-s]. It can also calculate with exact integers and fractions, native floating-point, and high-precision big floats. Maxima has user-friendly front-ends, an online manual, plotting commands, and numerical...
    Leader badge">
    Downloads: 3,628 This Week
    Last Update:
    See Project
  • 19

    Faum

    Fast Autonomous Unsupervised Multidimiensional Classification

    This is the proof-of-concept implementation of the FAUM Clustering method. This implementation was used to perform the published results and is now released in the hope that it will be useful.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    GDS3D

    Interactive 3D Layout Viewer for GDSII

    GDS3D is a cross-platform 3D hardware accelerated viewer for chip layouts. Read standalone GDS files or use the Cadence plugin for easy integration with your Virtuoso environment. Developed by PhDs of the IC-Design Group, University of Twente, The Netherlands The GDS3D project is also hosted on Github: https://github.com/icdut24/GDS3D, but currently not actively maintained by us. Forks can be found at https://github.com/trilomix/GDS3D and https://github.com/skuep/GDS3D
    Leader badge">
    Downloads: 11 This Week
    Last Update:
    See Project
  • 21
    To give users the full control over the running application. This means that an application is working according to its purpose but the control over the whole interface is taken from developer and given to users. While an application is running, users can move, resize, and tune all the screen objects through which the communication with an application is going. Set of files includes the book (both in DOC and PDF formats), a big demonstration project with all its files available (all the...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 22
    Settimino

    Settimino

    Arduino Ethernet library for communicating with S7 Siemens PLC

    Small but efficient communication library that allows your sketches the interfacing with Siemens PLC. Works starting from Arduino UNO with the W5100 ethernet shield.
    Leader badge">
    Downloads: 21 This Week
    Last Update:
    See Project
  • 23
    StartOS

    StartOS

    Linux server OS optimized for self-hosting

    StartOS is a sovereign, self-hosted operating system built by Start9 Labs to empower individuals with digital independence. Designed to run on personal servers, it provides a privacy-first interface for installing, managing, and running decentralized applications without needing technical expertise. StartOS includes services like Bitcoin nodes, messaging platforms, file hosting, and password managers, all running locally and without third-party control. With a user-friendly UI and strong...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 24
    elevmaps

    elevmaps

    3D map viewer for your hiking trips.

    ...Perfect maps view with zoom level commands IN/OUT and screen fingers gestures integration. . Import and view OziExplorer maps. . Download and attach elevations for current downloaded maps. . Use elevation data in SRTM3 format (stored in device and perfect for big maps) . Track recording in KML/GPX format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Decaleon

    Decaleon

    Multilingual Esperanto Translator, Word Dictionary, Vocabulary Trainer

    Esperanto Translator - translates sentences from 12 languages to Esperanto - VB.NET project based on XML - 30-60,000 words in each language: extensible Vocabulary for English, French, Spanish, Portoguese, Italian, German, Danish, Swedish, Greek, Russian, Polish and Croatian including context and synonym data as well as semantic group information. It includes an any-to-any multilingual Dictionary and Vocabulary Trainer (learning niveaus/logic themes/semantic groups) and uses UTF8/Unicode Characters; Export into Text, HTML, TeX, PDF Files; Text files may be imported in other Vocabulary Training Software. Version 6.0 gives a big addition of standard words and supports another 23 Languages; a small Vocabulary for: Albanian, Bulgarian, Czech, Dutch, Finnish, Hungarian, Norwegian, Romanian, Serbian, Slovak, Slovene, Turkish, Ukrainian, Interslavic, Arabian and Asian Languages. ...
    Downloads: 1 This Week
    Last Update:
    See Project