MiraOmics

Data Infrastructure and Analytics

Berkeley, CA 272 followers

Truly Understand Your Data

View all 3 employees

About us

Bioinformatics Professional Services for pharma and biotech. We help you truly understand your data! Home of MiraTyper - cell type everything

Website: https://miraomics.bio/
External link for MiraOmics
Industry: Data Infrastructure and Analytics
Company size: 2-10 employees
Headquarters: Berkeley, CA
Type: Privately Held
Founded: 2024

Locations

Primary

Berkeley, CA 94708, US

Get directions

Employees at MiraOmics

See all employees

Updates

MiraOmics

272 followers
3mo
Report this post
Get ready to watch and listen to Eugene Bolotin, Ph.D. discuss the MiraTyper from MiraOmics and how to simplify your cell typing!

Pythia BioSciences

3,255 followers
3mo

📌 What will be discussed in our upcoming webinar "From Single Cells to AI-Driven Target Selection and Translational Workflows"? Focusing on single-cell data, AI, and their integration, this session provides an opportunity to explore not only the growing trends of AI in single-cell analysis, but also - in greater depth - how target discovery and translational workflows are evolving under its influence. Check out the topics we’ll be discussing below, and don’t forget to register to join us live! Date and time: 11AM EST | 4PM GMT Wednesday November 12, 2025 Link for registration 👇

Like Comment Share
MiraOmics reposted this
Eugene Bolotin, Ph.D.
4mo Edited
Report this post
🚀 Exciting Results: MiraTyper from Miraomics Matches or Exceeds Leading Cell Typing Embedding Models We’re thrilled to share that we have benchmarked our MiraTyper embeddings against the Chan Zuckerberg Cell Typing Leaderboard — and the results speak for themselves. Using both the CZ benchmark dataset and our internal evaluation framework, MiraTyper matches or exceeds the performance of the top public embeddings on the most relevant part of the benchmark (logistic regression). But performance is only part of the story. The CZ benchmarks test embeddings only — the mathematical “engine” that represents cell properties. However, in practice, you can’t drive an engine alone. To classify cell types, you need both the engine (embedding) and the vehicle (classifier) that interprets it. MiraTyper combines both. Our embedding is state-of-the-art — but our integrated classifier, trained on over 3.5 million cells, takes it further by delivering ready-to-use, high-accuracy cell type predictions right out of the box. No need to build the rest of the car! That means: ⚙️ High performance: Benchmarked, state-of-the-art embedding performance 🧠 Complete package: A powerful built-in classifier for actual annotation 🏢 Private and easily deployable: Immediate on-premise deployment — no GPU, no complex setup 👩🔬 Ready to go: No need for a dedicated bioinformatician to convert embeddings into labels 💡 Customizable: Tailor models to specific client datasets and use cases If the embedding is the engine, MiraTyper is the car — built, tuned, and ready to drive. —------------------------------------------------------------------------------ As a reminder: ☕️ For less than the price of a coffee, you can now try MiraTyper on your own data on LatchBio and examine it in detail on Pythia BioSciences platform. Reach out to us to find out how. Links to the benchmarking tools by CZ and how to try MiraTyper are in the comments:
2 Comments

Like Comment Share
MiraOmics reposted this
Eugene Bolotin, Ph.D.
4mo Edited
Report this post
🎉 Exciting News: MiraTyper from Miraomics is Now Publicly Available on LatchBio! After months of development, we're thrilled to announce that MiraTyper is now accessible to the research community for automated cell type annotation at unprecedented resolution. What is MiraTyper? Amazing cell type prediction for your single cell RNA-seq data! Just upload your scRNA-seq, H5AD file and get highly detailed cell type annotations in under 10 minutes. Getting Started is Easy: 1️⃣ Create a free LatchBio account and add payment info 2️⃣ Find MiraTyper in public workflows 3️⃣ Upload your h5ad file 4️⃣ Get results in <10 minutes Why MiraTyper? ✅ >80% hierarchical precision and recall on 10x genomics generated human data. 🔬Batch resistant 🔬Tissue agnostic 🔬Works on spatial data (Visium, Xenium, CosMx) 🔬Can work across organisms (through homologous gene mapping). Demo Version Details: 🔬 Files up to 100k cells 🔬 Data randomly subsampled to 50% for demo (e.g., 10,000 → 5,000 cells) Full Version Details: 🔬No demo limitations 🔬Local install available through docker or python package Customization through professional services: 🔬Unusual model organisms. 🔬Rare or unusual cell types. 🔬Custom disease tuning (cancer, other) We can adapt the model for your needs! We Want Your Feedback! We're actively collecting bug reports and user feedback to improve MiraTyper. Don't hesitate to reach out, through linkedin, or send an email at eugene@miraomics.bio, if you have comments, suggestions, custom requirements. #SingleCell #Bioinformatics #ScRNAseq #CellTyping #Genomics #ComputationalBiology #ResearchTools Special thanks for RJ Honicky for driving this project and Pythia Biosciences for providing 10TB of training data! Follow these steps on LatchBio after making the account.
2 Comments

Like Comment Share
MiraOmics reposted this
Eugene Bolotin, Ph.D.
6mo Edited
Report this post
🧬 Exploring the Significance of Benchmarking with Hierarchy in Cell Typing When we built our Cell Typer -MiraTyper, we decided to evaluate it against state of the art methods. However, we quickly run into a problem! Our accuracy was really low (F1, precision, recall) ! However, when we looked closely into our results, we were calling cells at a very fine level and beating the competition. For example: we labeled "Naive CD4 T-cells" vs competition labeled "CD4 T-cell". What happened? ⚠️ The Pitfall of Standard Metrics Conventional evaluation methods tend to treat all classification errors uniformly, overlooking the nuanced biological implications of misclassification scenarios. 🌿 Embracing Hierarchical Benchmarking Hierarchical benchmarking introduces a paradigm shift by leveraging the Cell Ontology to assess the biological proximity of predictions. This approach not only acknowledges near-miss predictions but also rewards their significance in real-world applications, offering a more nuanced evaluation of model performance. 📊 Performance Insights from Miraomics In the analysis of brain and immune datasets, Miraomics showcases its prowess: - Hierarchical Recall: 0.97 (compared to 0.11 by CellTypist) - F1 Score: 0.98 (in contrast to 0.19) - Lightning-fast processing under 1 minute for 30k+ cells - Streamlined operations with no need for manual model selection or label harmonization ⚡ Don't Miss the Biological Context Relying solely on flat accuracy metrics for model evaluation overlooks the critical biological context that hierarchical benchmarking can unveil. Stay attuned to the nuances of cell typing with a comprehensive approach like hierarchical benchmarking.
1 Comment

Like Comment Share
MiraOmics reposted this
Eugene Bolotin, Ph.D.
6mo
Report this post
🚀 Presenting Miraomics Cell Typer - MiraTyper: Revolutionizing Cell Type Annotation 🧬 Miraomics is reshaping the landscape of single-cell analysis with our cutting-edge Miraomics Cell Typer. In a recent benchmarking study against established tools like CellTypist. The outcomes speak volumes: ✅ Achieved a remarkable 10x higher recall in brain datasets ✅ Demonstrated a 2x higher recall in immune datasets ✅ Attained an impressive up to 98.4% hierarchical F1 score ✅ Executes in less than 1 minute for over 30,000 cells 🔍 What makes us stand out? Our platform is powered by a hybrid generative AI model, trained on a vast dataset of over 100 million cells from the CellXGene collection. Key distinguishing features include: - Cluster and batch agnostic design, eliminating the need for manual clustering - Pre-harmonized labels aligned with CL ontology, requiring no post-processing - Adaptable and customizable to any dataset and tissue While others grapple with manual model selection and label normalization, Miraomics ensures swift and consistent results with exceptional accuracy. Whether unraveling immune cell signaling intricacies or mapping the human neocortex, our technology keeps pace with the complexities of biology. 🧪 Curious about our performance on your specific data? We provide tailored benchmarking services for clients to witness the impact firsthand. 📊 Explore the comprehensive performance breakdown, visual representations, and more detailed insights in our latest benchmarking report. 🔗 Connect with us to learn more: info@miraomics.bio or reach out to us on LinkedIn! Additional details: https://lnkd.in/gW9ncGYr #singlecell #celltyping #genomics #biotechinnovation Miraomics RJ Honicky
Like Comment Share
MiraOmics

272 followers
7mo
Report this post
We are incredibly excited to see where we will take AI in the future!
Where Tech Meets Bio

3,652 followers
7mo

In today's Where Tech Meets Bio: 🧬 Chai Discovery launched Chai-2, an all-atom generative AI model achieving 16–20% hit rates in zero-shot antibody design across 52 novel antigens. 🌿 Tamarind Bio’s Deniz Kavi introduced #IntFold, a structure prediction model matching #AlphaFold 3 on #FoldBench with improved antibody-antigen and protein-ligand performance. 💰 Medtech VC funding reached $4.1B in Q1 2025, the strongest since early 2022, driven by Neko Health ($260M) and EverBridge Medical ($139M), per Dr. Luka Nićin of Pace Ventures. 📖 Miraomics, Pythia Biosciences, and LatchBio released a 30M-cell atlas across 150 diseases and 200 tissues with an AI tool for automating molecular data curation. 🔍 OWKIN and The Newcastle Upon Tyne Hospitals NHS Foundation Trust announced a 5-year partnership to apply agentic AI to oncology research. 🤏 XtalPi Inc. and Pfizer expanded their partnership to develop a high-throughput AI- and physics-based platform for small molecule discovery. 🏛️ Sweden’s Medical Products Agency Läkemedelsverket launched REGULUS, a generative AI tool for drug regulatory support. 🏥 Maxime Griot, MD and colleagues deployed an on-prem, GDPR-compliant open-source LLM chatbot within the Epic EHR at a European hospital. 🔬 Columbia University Fertility Center used AI to detect viable sperm within hours in an azoospermic patient. 🔭 PathAI received FDA 510(k) clearance for its digital pathology platform used in primary diagnosis. 🔍 TOBY. secured FDA Breakthrough Device Designation for its AI-powered urine test for early bladder cancer detection. 💰 Portal Biotech raised $35M Series A from NATO Innovation Fund (NIF) and Earlybird Venture Capital to commercialize AI-powered nanopore-based protein sequencing. 💰 XtalPi Inc. led the pre-seed round in Foundry Biosciences, an MIT spinout focused on gen-AI protein design for anti-aging therapies. ❤️ Venstra Medical received $1M from MTPConnect to develop its miniature heart pump for treating cardiogenic shock. 🤝 AbbVie acquired Capstan Therapeutics for up to $2.1B to access its LNP platform for in vivo mRNA-based CAR-T therapies. 💰 Laverock Therapeutics raised €23.3M in seed funding (led by Calculus Capital, Lilly and others) to develop its programmable gene control platform. 💰 Tandem Health raised $50M Series A from Kinnevik to build an AI-native clinical OS for documentation, coding, and decision support in European health systems. 🤝 argenx signed a $1.5B deal with Unnatural Products Inc. to develop AI-designed macrocyclic peptides for undruggable targets. 💰 RainPath AI raised €2.5M from Teampact.ventures, ADVANS Lab, Bpifrance, and others for AI-driven virtual staining of biopsies. ✅ FDA removed REMS restrictions on all CAR-T therapies for blood cancers. 💭 Fred Hutch’s Aaron Ring argues that AI is transforming early-stage drug discovery for small labs but won’t revolutionize Big Pharma. Read more: https://lnkd.in/dZEShpS4
Like Comment Share
MiraOmics

272 followers
8mo
Report this post
We’re thrilled to announce that 30 million single-cells have just been released through our partnership with LatchBio and Pythia Biosciences! This marks a major step forward in miraomics’ mission to help scientists truly understand their data. By making this massive dataset freely and easily accessible, we are: 🔓 Breaking down the silos that fragment biological research 🌍 Placing our clients' data in the broader context of global biological knowledge 🧠 Empowering discovery by connecting isolated experiments to high-quality, richly annotated public datasets At Miraomics, we believe that data should be as useful as it is abundant. This release is just the beginning. We’re building the infrastructure and tools to ensure that every client can surface insights from their own experiments in the context of the world’s data. “We are excited to announce this major release of high quality curated data, representing thousands of hours of curation effort, enabling new opportunities for development of novel AI tools and novel insights in basic science, disease progression and drug discovery,” said Eugene Bolotin, Co-Founder and CEO at @Miraomics. https://lnkd.in/g7wgXDa4

Miraomics, Pythia Biosciences and LatchBio Release a 30 Million Cell Atlas and an Agentic AI Framework for Molecular Data Curation businesswire.com

Like Comment Share
MiraOmics reposted this
Eugene Bolotin, Ph.D.
8mo
Report this post
I greatly enjoyed attending the QBI symposium, "Separating the Signal from Noise: AI in Biology." Many amazing talks were presented from basic science to drug discovery. Big thanks to the conference organizers and presenters. My key takeaway: AI in Biology is still very early, but amazingly promising. Most of the concepts involved "scientist in the loop", with AI serving as a virtual assistant, multiplying the speed the expert could do the work thousands of fold. However, we are still far from completely autonomous science. We are also thinking about accelerating research and we are grateful to the organizers for accepting our poster for presentation: "Towards an Easy Transformer-based Universal Cell-Typer". In it, we describe our latest efforts between: miraomics, RJ Honicky, Tùng Nguyễn from Pythia Biosciences to automate cell typing, also known as cell annotation, using transformer based methods. This is a foundational problem in biology, since identifying cells is often the first step in many biological experiments. Our framework is designed to be extendable across multi-omic datatypes such as: scRNA, proteomics, and image data through transformer enabled co-embedding. We are working on eliminating batch effects across methods and labs, benchmarking against current state-of-the-art. We aim to take the pain out of manual or semi manual cell typing, while keeping the "expert in the loop." If you are interested in learning more and getting the copy of the poster, getting early access to our code or collaborating, please don't hesitate to reach out! https://lnkd.in/gECiEH6h

Separating the Signal from the Noise: AI in Biology qbi.ucsf.edu

3 Comments

Like Comment Share
MiraOmics

272 followers
9mo
Report this post
We're looking forward to doing more great work with Latch!

Kenny Workman
9mo Edited

Money is tight in biotech. As NIH grants and venture funding dry up, we are seeing teams reach for publicly available datasets as a replacement or supplement to internal data generation. These teams are building large and tailored atlases: pooling molecular data from hundreds of studies and thousands of patients in the exact disease and tissue areas of interest to internal programs. We see two dominant curation models: 1/ Outsourcing to specialized solution providers (Excelra, Pythia Biosciences, miraomics). More common across our pharma clients who already have procurement processes (and budget) in place. 2/ Building internal labeling teams. Common in upstarts investing in a computational platform, tool development and tight feedback loops with model training. Public data is a powerful and underutilized resource, but the savings in capex come with an enormous human labor cost. These datasets are highly unstructured and their practical use requires purpose built tooling. We have seen adoption of Latch across both curation models: 1/ We equip solution providers with modern infrastructure and curation tools to deliver faster and higher quality results to clients. We provide a portal for them to distribute and reuse their clean data to our customer base for additional revenue. 2/ We train and onboard internal biotech teams to a suite of curation tools that increase ingestion throughput without sacrificing control. While there is a lot of talk about AI/ML transforming R&D, the curation products we're building are the first instance (I've seen) of LLMs having truly outsized economic impact on research. Some concrete tasks: - Constructing count matrices from unstructured author supplements - Harmonizing metadata against controlled vocabularies using entire papers - Using paper information and general immunology knowledge to create first pass ("best effort") cell type annotations You still need a "human in the loop", but we are seeing per-dataset curation times drop from 4/6 hours to 10mins/1hr. High variance but generally OOM improvements. Engineering drives the efficiency and trends in constituent tooling points (eg. swapping in better models) point to further and rapid gains. We are in the design partnership phase with a handful of companies across both groups. Reach out to work with our team in this early phase and otherwise looking forward to an exciting product release late this summer.

Like Comment Share
MiraOmics

Brand partnership • 272 followers
9mo
Report this post
We’re excited to present our latest progress as part of the collaboration between Pythia Biosciences and miraomics, focused on building cutting-edge methods for cell type annotation and multi-omic data interpretation. This joint effort brings together deep expertise in computational biology and single-cell analytics to unlock richer, more precise insights into complex biological systems.

Pythia BioSciences

3,255 followers
9mo Edited

#GenerativeAI is transforming the way we analyze data — but what happens when it meets the complexity and scale of #MultiOmics? Join us for a new webinar of our #OmicsConnect series to hear from Eugene Rakhmatulin – Chief Technology Adviser at Pythia Biosciences (Former CTO at Cohen Veterans Bioscience), and Dr. RJ Honicky – Senior AI Consultant at miraomics, about this exciting topic. Explore how Large Language Models (LLMs) are reshaping the omics research landscape — offering unique advantages over "traditional" ML, particularly: - How they empower cell typing, MOA hypothesis generation, pathway interpretation, and interactive Q&A with omics data - The open-weights vs. proprietary models landscape — examining methods like RAG and fine-tuning for incorporating specialized knowledge, and sharing insights into common pitfalls 📅 Date and time: 9AM Pacific Time | 12PM Eastern Time | 4PM GMT Thursday May 29, 2025 🔗 Registration: https://lnkd.in/dGYVHWyk Looking forward to seeing you in the webinar!

Like Comment Share

Browse jobs

Account Executive jobs

71,457 open jobs

LinkedIn respects your privacy

MiraOmics

Data Infrastructure and Analytics

Berkeley, CA 272 followers

Truly Understand Your Data

About us

Locations

Employees at MiraOmics

RJ Honicky

Alaina Puleo

Updates

Join now to see what you are missing

Similar pages

Pythia BioSciences

LatchBio

Excelra

MemVerge

Santa Ana Bio, Inc.

Outer Biosciences

BioTuring

Stealth Mode Biotech

Couloir Bio

IBD Leuven

Browse jobs

Account Executive jobs