[go: up one dir, main page]

Showing 28 open source projects for "linguistics"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Evertune | Improve Your Brand's Visibility in AI Search Icon
    Evertune | Improve Your Brand's Visibility in AI Search

    For enterprise marketing teams looking for a platform to understand and influence how AI models like ChatGPT recommend their products or services.

    Evertune is the Generative Engine Optimization (GEO) platform that helps brands improve visibility in AI search across ChatGPT, AI Overview, Gemini, Claude and more.
    Learn More
  • 1
    The goal of this project is to investigate optimal ways to do genre classification for the ten indigenous South African languages. Funded by Dept of Arts and Culture of the SA Government. http://www.trifonius.co.za/projects/genre-classification
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    The Hspell Project

    The Hspell Project

    A free Hebrew spell-checker and morphological analyzer

    hspell - Hebrew spellchecker
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    KH Coder

    KH Coder

    Quantitative Content Analysis or Text Mining

    ************************************************************ THIS PROJECT IS MOVED. See http://khcoder.net/en for the latest & greatest. You can download this tool from the new home. See you there! ************************************************************
    Downloads: 0 This Week
    Last Update:
    See Project
  • Propel Software: Product Value Management Platform for Manufacturers Icon
    Propel Software: Product Value Management Platform for Manufacturers

    For modern product companies that need to connect product and commercial teams successfully

    Propel is a cloud-native Product Value Management platform that unifies PLM, QMS, and PIM in one connected system, giving manufacturers complete visibility and control across the entire product lifecycle. It provides a single source of truth for all product data, streamlines change management, strengthens quality and compliance processes, and accelerates time-to-market by eliminating the silos and manual steps that slow teams down.
    Learn More
  • 5

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are...
    Leader badge">
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    ElixirFM

    ElixirFM

    Functional Arabic Morphology

    ElixirFM is a high-level implementation of Functional Arabic Morphology. The core of ElixirFM is written in Haskell, while interfaces in Python and Perl support lexicon editing and other interactions. http://github.com/otakar-smrz/elixir-fm
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Encode Arabic
    Encode Arabic provides tools for encoding and decoding Arabic in Haskell, Python, Perl, or LaTeX. Interprets the ArabTeX notation to generate original orthography or phonetic transcription. Supports Buckwalter and other romanizations. Converts legacy byte encodings into Unicode. http://github.com/otakar-smrz/encode-arabic
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    This project concerns the development of human language technology resources, based on the approach to share or recycle resources between closely related language. http://gerhard.pro/closely-related-languages/
    Downloads: 0 This Week
    Last Update:
    See Project
  • Loan business software for any kind of loan Icon
    Loan business software for any kind of loan

    For established lenders, loan-granting scaleups and credit startups.

    CreditOnline is an end-to-end integrated loan business management platform helping organizations connect and integrate all relevant parties in a state-of-art lending ecosystem that supports any kind of loan or lending segment. Our future-proof lending technology and partnerships render it simple, fast and cost-effective to expand the portfolio of your services and number of your clients, allowing your business to scale up or reinvigorate itself effectively at any stage of the business life-cycle.
    Learn More
  • 10
    This project is devoted to the development of natural language processing tools and resources for the Lingala language, which is spoken by tens of millions of people in central Africa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Automatic Compound Processing (AuCoPro)

    Automatic compound splitting and semantic analysis of compounds

    The central problem to be addressed in this project concerns a multidisciplinary (linguistics and computational linguistics) investigation into sharing of knowledge and resources between closely-related languages, specifically relating to the automatic processing of compounds. Specifically, we will explore the possibility to create new knowledge about closely-related languages, and efficiently develop additional, more advanced resources for (a) compound segmentation; and (b) the semantic analysis of compounds; as such, the project will be divided into two interrelated subprojects, to be executed simultaneously. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Perstem
    Perstem is a Persian (Farsi) stemmer, morphological analyzer, transliterator, and partial part-of-speech tagger. Inflexional morphemes are separated or removed from their stems. Perstem can also tokenize and transliterate between various character set encodings and romanizations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ValiTerms

    ValiTerms

    Validation of terms in corpus

    ValiTerms is a tool that helps the validation of terms in corpus. It finds their occurrences and allows terminologists to choose if a term is relevant or not. ValiTerms is developed at LIPN (http://www-lipn.univ-paris13.fr), RCLN team. Please consult the wiki for instructions about installation and usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Various tools for creating annotated parallel corpora including pre-trained tagging and parsing models for various languages, sentence alignment tools and word alignment tools. Uplug also includes a web-based interface for interactive sentence and word alignment and scripts for indexing and querying parallel corpora using the Corpus Work Bench CWB. Download 'uplug-main' first and then add other packages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Linguistico is a linguistics tools project based on Italian language. Tools are: dictionaries, thesaurus, words definitions, scripts, programs, ... For: OpenOffice.org LibreOffice ThunderBird Mozilla FireFox - MySpell MyThes Aspell HunSpell
    Downloads: 50 This Week
    Last Update:
    See Project
  • 16
    This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text. The method is based on SVM but other ML algorithms can be adopted. The method details are explained in the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    The Parenthesis Classifier takes the contents of a set of parentheses and classifies it into one of several categories. It includes a parenthesized-data extractor and the classifier.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The Simple Semantic Classifier classifies short chunks of natural language text into broad semantic classes that correspond to the OBO ontologies provided as input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A database of linguistic annotation of medical text (from MEDLINE), including corpora used with ABGene, BioCreative I and II, and the MedPost training corpus.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This program reads each of 270,000 entries of the BÍN database of ICELANDIC lemmata and all their forms. It assigns one of hundreds of morphological paradigms to each entry. It won a special award in the Þú átt orðið competition (www.ordid.is)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Varamozhi is a free English-Malayalam transliteration library. It can transliterate Malayalam text between Malayalam and English scripts. Varamozhi takes as the input, the mapping between a Malayalam font and a transliteration scheme; outputs functions i
    Leader badge">
    Downloads: 77 This Week
    Last Update:
    See Project
  • 22
    Sample turing machine for educational purposes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This project tries to make Spanish Wikipedia a useful resource for the language research community.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A sound change applier to simulate phonological shift, as from Latin to Spanish.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Based on the Buckwalter Morphological Analyzer (Version 1.0) for doing Arabic stemming and POS tagging. Includes a rewrite of the original Perl script, with better documentation and more flexible options, and a C++ interface (usable as a library or app).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next