[go: up one dir, main page]

Showing 17 open source projects for "tf idf java"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Create engaging surveys on your tablet or computer with ease. Icon
    Create engaging surveys on your tablet or computer with ease.

    Choose any of our carefully designed themes, or easily customize colors, fonts, and more to reflect your brand's true look and feel.

    Create great-looking surveys, forms, polls, voting, questionnaires, NPS, customer satisfaction, customer experience, employee satisfaction surveys... on your computer or tablet, customize the look of your survey however you like, & display collected data with eye-catching and insightful graphics.
    Learn More
  • 1
    BERTopic

    BERTopic

    Leveraging BERT and c-TF-IDF to create easily interpretable topics

    ...Instead, we can visualize the topics that were generated in a way very similar to LDAvis. By default, the main steps for topic modeling with BERTopic are sentence-transformers, UMAP, HDBSCAN, and c-TF-IDF run in sequence.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    tidytext

    tidytext

    Text mining using tidy tools

    tidytext brings tidy data principles to text mining by converting text into a tidy data frame format. It provides tools for tokenization, sentiment analysis, n‑gram creation, and term‑document matrices, enabling interoperability with dplyr, ggplot2, and other tidyverse workflows.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    bleve

    bleve

    A modern text indexing library for go

    ...By indexing your data with bleve you gain the ability to compose query types such as Term, Phrase, Match, Match Phrase, Prefix, Conjunction, Disjunction, Boolean, Numeric and Date Ranges, as well as Query String. Industry standard tf-idf scoring with query time boosting. Includes support for highlighting matching text within document fragments.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    natural

    natural

    General natural language facilities for node

    "Natural" is a general natural language facility for nodejs. It offers a broad range of functionalities for natural language processing. Tokenizing, stemming, classification, phonetics, tf-idf, WordNet, string similarity, and some inflections are currently supported. It’s still in the early stages, so we’re very interested in bug reports, contributions and the like. Note that many algorithms from Rob Ellis’s node-nltools are being merged into this project and will be maintained from here onward. While most of the algorithms are English-specific, contributors have implemented support for other languages. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Runn is a modern resource and capacity planning platform that gets remote teams on the same page. Icon
    Runn is a modern resource and capacity planning platform that gets remote teams on the same page.

    Runn is best suited for project managers, operations leads, resourcing managers and other people responsible for project delivery.

    Runn has a modern and easy-to-use interface that provides your team with a shared view of all the people and projects in your organization. Plan new work alongside existing projects and instantly see how changes to your plans and resourcing affect your company’s bottom line. Runn is intuitive to use and lets you quickly schedule work using simple drag and drop functionality. Runn also allows you to collaborate with your co-workers in real-time, seeing updates live without having to refresh your browser. Runn combines resource and capacity planning with integrated actual tracking and powerful forecasting to deliver meaningful insights and a full picture of your organization.
    Sign Up - 100% free until July!
  • 5
    TextGen

    TextGen

    textgen, Text Generation models

    ...EDA, simple data augmentation technique: similar words, synonym replacement, random word insertion, deletion, replacement. This project refers to Google's UDA (non-core word replacement) algorithm and EDA algorithm, based on TF-IDF to replace some unimportant words in sentences with synonyms, random word insertion, deletion, replacement, etc. method, generating new text and implementing text augmentation This project realizes the back translation function based on Baidu translation API, first translate Chinese sentences into English, and then translate English into new Chinese. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Texthero

    Texthero

    Text preprocessing, representation and visualization from zero to hero

    Texthero is a python package to work with text data efficiently. It empowers NLP developers with a tool to quickly understand any text-based dataset and it provides a solid pipeline to clean and represent text data, from zero to hero.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Finding topic sentences using TF-IDF. Download and start self container server, then issue a POST request with a single parameter called documents that is the paragraph content to find the three top topic sentences. The results are returned in JSON array. The settings.json file is required for the standalone server to function correctly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    CakeChat

    CakeChat

    CakeChat: Emotional Generative Dialog System

    CakeChat is a backend for chatbots that are able to express emotions via conversations. The code is flexible and allows to condition model's responses by an arbitrary categorical variable. For example, you can train your own persona-based neural conversational model or create an emotional chatting machine. Hierarchical Recurrent Encoder-Decoder (HRED) architecture for handling deep dialog context. Multilayer RNN with GRU cells. The first layer of the utterance-level encoder is always...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    TF-IDF.jar is a Java Archive file to measure TF-IDF of each document in a document collection (corpus). The jar can be used to (a) get all the terms in the corpus (b) get the document frequency (DF) and inverse document frequency (IDF) of all the terms in the corpus (c) get the TF-IDF of each document in the corpus (d) get each term with their frequency (no. of presence), term frequency (TF) and TF-IDF in every document
    Downloads: 0 This Week
    Last Update:
    See Project
  • Transforming NetOps Through No-Code Network Automation - NetBrain Icon
    Transforming NetOps Through No-Code Network Automation - NetBrain

    For anyone searching for a complete no-code automation platform for hybrid network observability and AIOps

    NetBrain, founded in 2004, provides a powerful no-code automation platform for hybrid network observability, allowing organizations to enhance their operational efficiency through automated workflows. The platform applies automation across three key workflows: troubleshooting, change management, and assessment.
    Learn More
  • 10

    TextualModelGenerator

    Generator for textual models by applying different techniques

    This is a project created and supported by: Angel Castellanos Juan Cigarrán Recuero Ana García Serrano This projects allows the modelling of textual contents by applying different techniques: TF-IDF KLD Mutual Information Chi^2 With this application the users can be able to extract the most representative terminology of a textual collection. The application is Java-based, allowing their execution in several platforms and operative systems (Windows, Linux, MacOS).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Texalyzer

    Text analyzer

    Analyzes text document using TF-IDF and optionally stopword list, and extracts important keywords.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    GUIFugue (eventually to become a part of JNIFugue) is a Java-based GUI intended to complement usage of the MUD client, TinyFugue. Using scripting in TinyFugue, users will be able to customize a GUI down to its most basic behaviors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    jIDF Editor
    It is a java based .idf file editor for any OS supporting java. Provides simple & efficient way to create/edit .idf (Input Data Files) for EnergyPlus. With many improved features from traditional editor, it's portable on any OS (MAC/Linux/Win...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A platform independent IDF editor which can be used across various platforms
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    NLP4J library is a toolset written in Java for Natural Language Processing. This version is oriented to Document Classification and uses Naive Bayes, TF-IDF, etc. There are also pre-processing tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    IDEAL means Information DEALer. A System wich provides the news and articles which the user wants. Using Tomcat, Struts, Java, MySQL an AgentSystem, Clustering, TF/IDF, Document Parser and it is multi user able.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Tired of having to "jar -tf" through a bunch of jar files to find the one that has the class you're missing (to kill the dreaded "NoClassDefFoundError")? JarBreaker is for you. Requires java => 1.2.2 and a jdbc compliant db.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next