[go: up one dir, main page]

WO2026006705A1 - Methods and systems for modeling multilayer networks - Google Patents

Methods and systems for modeling multilayer networks

Info

Publication number
WO2026006705A1
WO2026006705A1 PCT/US2025/035658 US2025035658W WO2026006705A1 WO 2026006705 A1 WO2026006705 A1 WO 2026006705A1 US 2025035658 W US2025035658 W US 2025035658W WO 2026006705 A1 WO2026006705 A1 WO 2026006705A1
Authority
WO
WIPO (PCT)
Prior art keywords
multiplex
network
networks
heterogenous
drug
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/US2025/035658
Other languages
French (fr)
Inventor
Michael Levin
Leo PIO-LOPEZ
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tufts University
Original Assignee
Tufts University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tufts University filed Critical Tufts University
Publication of WO2026006705A1 publication Critical patent/WO2026006705A1/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/20Ensemble learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition

Definitions

  • Multilayer networks which incorporate various nodes and connections in formats such as multiplex, heterogeneous, and bipartite networks, provide an effective framework for merging diverse and multi-scale biological data sources.
  • current network embedding methods face challenges and limitations in addressing the heterogeneity and diversity of these networks. Therefore, there is an essential need for the development of new network embedding methods to manage the complexity and diversity of multi-omics biological information effectively.
  • SUMMARY [0003] Disclosed herein are methods, systems, and computer readable media for characterizing a multilayer network.
  • the method includes: obtaining a multilayer network including a plurality of multiplex heterogenous networks and a plurality of bipartite networks connecting the multiplex heterogeneous networks in the plurality of multiplex heterogeneous networks, in which each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks includes a plurality of nodes; applying a trained machine learning algorithm to relate the plurality of multiplex heterogeneous networks to an embedding space, in which the trained machine learning algorithm uses random walk with restart; using the embedding space of the plurality of multiplex heterogeneous networks to extract information corresponding to interactions between the plurality of nodes of each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks; and outputting the embedding space and the extracted information to a user.
  • FIG.1 shows a universal multilayer network consisting of 3 distinct multiplex networks (gene, drug and disease), each represented by different colors (yellow, purple, and blue).
  • Each of these multiplex networks consists of various types of nodes: squares are genes, circles are drugs and triangles are diseases. They are interconnected through three bipartite networks (gene-disease, drug-target and drug-disease), which are visualized here as bipartite interactions for clarity.
  • FIG.2 shows MultiXVERSE logical flow (figure adapted from [8]). We start with the adjacency matrix of the multilayer network; then, we apply universal random walk with restart to obtain the similarity matrix; and finally we apply the VERSE algorithm to compute the embeddings for diverse types of applications.
  • FIG.3 shows network representation of the progeria cluster. Blue, orange are respectively genes, diseases. Black, light blue, and red links are respectively molecular multiplex, gene-disease, and disease multiplex links [0008]
  • FIGS.4A-4C show network representations of the clusters integrating cancer or developmental disorders and neurotransmitters.
  • FIGS.5A-5E show melanocyte conversion phenotype induced by muscimol exposure.
  • FIG.5A, FIG.5B are taken through the gut
  • FIG.5C, FIG.5D are taken through the mid-tail.
  • FIG.5A Control embryos show small numbers of discrete, round melanocytes (white arrows).
  • FIG.5B In contrast, muscimol exposure induces melanocytes to over-proliferate and form long, stretched out projections that cover the gut cavity and other locations (red arrows).
  • FIG.5C melanocytes in muscimol-treated animals
  • FIG.5D melanocytes in muscimol-treated animals
  • FIG.6 shows an example process to characterize a multilayer network in accordance with some embodiments of the disclosure.
  • FIG.7 shows an example system configured to characterize a multilayer network in accordance with some embodiments of the disclosure. DETAILED DESCRIPTION [0012] In accordance with some embodiments of the disclosed subject matter, methods, systems, and computer readable media for characterizing a multilayer network are provided.
  • a network is made of a set of nodes and edges which connect the nodes. Nodes represent elements of a network, and edges define how the elements interact.
  • a node may be a particular protein, and an edge may determine physical interaction.
  • Edges may be unweighted (e.g., no assigned value or interaction) or weighted (e.g. have an assigned value that influences how an edge interacts with different nodes).
  • a node may also be directed or undirected.
  • a directed edge may include and assigned value which dictates that node A affects node B, but node B does not affect node A or affect node A to a lesser degree.
  • a multilayer network may be used to model systems with different levels of interactions.
  • a multilayer network is a network including two or more layers, each layer including a set of nodes and edges, and edges connecting nodes on the nodes in the layer, as well as the different layers.
  • a visual representation of a multilayer network is shown in FIG.1.
  • a multiplex heterogeneous is a multilayer network that may include monoplex and multiplex networks connected through bipartite interaction networks.
  • a monoplex network refers to a one-layer network composed of one type of nodes and one type of edges.
  • a multiplex network refers to a multilayer network in which each layer is a monoplex network. In some embodiments, all layers in a multiplex network include the same set of nodes, but the edges are in different categories.
  • a bipartite network includes transitions between different types of nodes in different networks (e.g., different multiplex or monoplex networks).
  • a homogenous network refers to a network in which all nodes are the same type of node, and all edges are the same type of edge.
  • a heterogeneous network refers to networks that link different types of nodes through bipartite interactions.
  • a multiplex-heterogeneous multilayer network is a multilayer network made up of any number of monoplex and/or multiplex networks, linked by bipartite networks. The edges in a monoplex/multiplex network may be directed and/or weighted.
  • a multiplex-heterogeneous multilayer network generally includes both networks with unweighted edges, and networks with weighted and/or directed edges. The edges in a bipartite network may similarly be directed and/or weighted.
  • the multiplex-heterogeneous multilayer network includes more than three multiplex heterogenous networks, more than three bipartite networks, and at least two multiplex networks linked by one bipartite.
  • the multiplex-heterogeneous networks comprises a drug- disease-gene networks composed of 3 multiplex networks (drug-drug, disease-disease, and gene- gene multipelx networks) and 3 bipartite (drug-target, drug-disease and disease-gene).
  • Network embedding is a method of encoding high-dimensional network information into feature vectors. Network embedding may be used for a variety of applications, including community detection, node classification, and link prediction. The methods described herein allow network embedding of multilayer networks, including multiplex heterogeneous, without limitation.
  • network embedding includes determining a similarity matrix as a probability distribution (see FIG.2 for a schematic of an example process). Random walk with restart may be used to determine a probability similarity matrix between nodes in a network. Random walk with restart is a computationally efficient way of exploring and optimizing the similarity matrix. Random walk with restart includes parameters that regulate the random walk’s movement across layers, networks, and bipartite connections. [0025] In some embodiments, determining network embedding includes using machine learning approaches to optimize the embeddings. The machine learning approaches can include any suitable approaches, including matrix factorization, hypergraph embeddings, neural networks, or artificial intelligence. In some embodiments, Kullback-Leibler minimization.
  • Embeddings of genes, disease, and drugs can then be used for drug repositioning, finding synergy of drugs, identify new genes associated to a disease, or new biological communities associated to a disease.
  • the embeddings may be used as input for an LLM. Having embeddings opens the complete machine learning toolbox for subsequent analysis.
  • Datasets for Determining Multilayer Networks [0028]
  • a multilayer network may correspond to a dataset that determines the layers, edges, and nodes of the network.
  • the multilayer network used to explore a disease may have three networks: a human molecular network (e.g., a network representing molecular systems and interactions in humans), a drug network (e.g., a network representing drug interactions), and a disease network (e.g., a network representing aspects of disease relationships).
  • the bipartite networks may then determine the interactions between the networks.
  • the datasets may be determined by published literature and/or experimental data. In some embodiments, the characterization of the multilayer network may be compared against experimental data.
  • General Applications [0031] Once the embeddings have been determined, additional machine learning methods may be used to determine further information about the multilayer network.
  • methods can be used for link prediction, clustering, joint embeddings, and other information.
  • the information may be output to a user, e.g., a researcher or a clinician.
  • Machine learning approaches such as clustering approaches, neural networks, random trees, artificial intelligence, or other relevant machine learning approaches, may be used to interpret the embeddings and the information extracted from the embeddings.
  • the methods described herein may be used to determine network embeddings of multilayer, multiplex heterogeneous networks, a long-felt need in the field of modeling complex networks. This is particularly important for modeling biological networks, which are known to be complex and include a variety of different types of interactions.
  • the multiplex-heterogeneous networks described herein include networks including several types of nodes and edges, and linking same types of nodes and different types of nodes. Additionally, the method may include more than one bipartite network. The methods described herein can be applied on any type of multilayer network, whatever the number of multiplex network, and bipartite linking them, on multiplex composed of directed, undirected, weighted, unweighted networks. [0034] Additional Applications [0035] In some embodiments, the multilayer network may correspond to a dataset corresponding to a disease.
  • the datasets may include at least one of a molecular dataset, a drug dataset, or a disease dataset.
  • the network embeddings may be used to identify at least one drug related to a disease.
  • the method may be used to determine new applications for known drugs (e.g., drug repurposing), identify drug interactions, or identify a new drug or drug target to treat a disease.
  • the methods described herein may be used to find new genes associated with a disease. This may be done by clustering the embeddings after the embeddings have been determined.
  • the methods described herein may be used for drug repositioning (e.g., using a known drug for a new purpose, such as treating a different disease).
  • the embeddings may identify several drugs that are associated with a specific disease. In cases such as these, the methods may be used to find drug synergies.
  • the methods described herein may be used to characterize a variety of systems related to specific applications.
  • the characterized networks may be used to study birth defects, regenerative medicine, aging, degenerative disease, cancer, neurological disease, in vitro bioengineering, synthetic food production, and agriculture.
  • Example 1 - Universal multilayer network embedding reveals a causal link between GABA neurotransmitter and cancer
  • Network graph models are highly effective for depicting real-world objects through their relationships and interactions. They offer valuable insights into the connections between different entities and are utilized as tools to investigate complex systems across various fields. A significant challenge in machine learning involves converting high-dimensional graph- based data into a feature vector.
  • Network embedding also known as graph representation learning, addresses this issue by transforming network data into formats compatible with conventional machine learning tools, thereby broadening the scope of machine learning applications in network analysis.
  • Network embedding techniques have proven highly effective across numerous applications, including community detection, node classification, and link prediction. Capable of handling vast networks with millions of nodes, these techniques are particularly valuable in the era of big data. Consequently, network embeddings are increasingly used to analyze various large-scale networks, such as social, neuronal or molecular networks.
  • the volume and complexity of biological data have significantly increased in recent years, often represented as multilayer network models.
  • Multilayer networks which incorporate various nodes and connections in formats such as multiplex, heterogeneous, and bipartite networks, provide an especially effective framework for merging diverse and multi- scale biological data sources.
  • current network embedding methods face challenges and limitations in addressing the heterogeneity and diversity of these networks. Therefore, there is an essential need for the development of new network embedding methods to manage the complexity and diversity of multilayer networks effectively.
  • MultiXVERSE a universal multilayer network embedding method that we named MultiXVERSE. Our method can handle any multilayer network defined as a composition of various multiplex and monoplex networks interconnected through bipartite interaction networks (see FIG.1 for an example multilayer network). Within this multilayer structure, each network may also be weighted and/or directed.
  • MultiXVERSE provides a means to network embedding on these multilayer networks, which are characterized by their rich and complex interactions. This approach is particularly effective in representing the multi-scale interactions typically observed in biological systems. For biology, this approach allows us to aggregate network data from drugs, diseases, genes, patients, etc. in the same network representation and machine learning can be applied on the resulting embeddings for a wide variety of application including drug repositioning, new predicted gene-disease or drug-target links, the discovery of specific biological functional modules for diseases integrating genes and drugs, and more.
  • Several methods have been developed recently in AI and the mining of knowledge graphs.
  • network embedding can be applied to any kind of multilayer networks, including multiplex-heterogeneous networks, without any limitations on the number of multiplex networks or the type of networks (weighted, directed, undirected).
  • MultiXVERSE MultiXVERSE to a biological multilayer network containing data on gene, drug, and disease interactions and evaluated the quality of the embedding using link prediction (a standard approach in multilayer network embedding).
  • link prediction a standard approach in multilayer network embedding.
  • link prediction to the embeddings of GABA agonists drugs and found new links between GABA receptors and cancer.
  • the similarity metric for any given node ⁇ within the multiplex-heterogeneous network ⁇ ⁇ is conceptualized as a probability distribution. Given that, one can obtain the normalized similarity distribution within the embedding space by applying the softmax function. Formally, let ⁇ represent the embedding of node ⁇ within this space. Consequently, the similarity between the embeddings of two nodes ⁇ and ⁇ is characterized by the dot product ⁇ ⁇ ⁇ ⁇ ⁇ , yielding the following expression: [0052] ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ , .
  • NCE Noise Contrastive Estimation
  • ⁇ NCE generates ⁇ negative samples ⁇ from the noise distribution Q(u).
  • the objective function is expressed as the negative log- likelihood, which is minimized using logistic regression: [0061] L ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ 1
  • the similarity function ⁇ i ⁇ ⁇ b ( ⁇ ,.) is computed without normalization. It has been demonstrated that as s increases, the derivative of NCE approaches the gradient of cross-entropy, though in practice, small values are often sufficient.
  • VERSE offers a general framework for network embedding, with the primary requirement that simG be a probability distribution.
  • the similarity simG in the multiplex-heterogenous network is computed using MultiXrank.
  • MultiXVERSE applies Kullback-Leibler minimization to optimize the embeddings.
  • the parameters ⁇ , ⁇ , ⁇ 1 , ⁇ 2 , ⁇ 3 , and ⁇ in the MultiXrank framework define different probabilities governing transitions in the random walk with restart (RWR) on universal multilayer networks.
  • the parameter ⁇ represents the probability of restarting the random walk within a given multiplex network, ensuring that the walker does not transition to another multiplex network at every step. The sum of all ⁇ values across multiplex networks is constrained to equal 1. We assigned the same probability of restarting in each multiplex.
  • the parameter ⁇ controls the probability of jumping between multiplex networks, allowing transitions across bipartite connections and ensuring that the walk spreads across the universal multilayer structure. We assigned the same probability of jumping in each multiplex.
  • the parameters ⁇ 1 , ⁇ 2 , and ⁇ 3 define the probability of moving between layers within the same multiplex network, thereby capturing layer-specific transition dynamics.
  • the random walker explores several multiplex networks, the probability is defined as 0.5 for each multiplex; therefore, the random walker has the same probability to jump from one layer to another as to stay in the layer.
  • the parameter ⁇ governs the probability of restarting the random walk at a specific layer within a multiplex network, ensuring a balanced exploration of different layers.
  • the sum of all ⁇ values within a multiplex network must also sum to 1.
  • has been defined to assign the same probability of restarting in one layer of a specific multiplex network.
  • RWR and NCE are known to be fast and efficient methods; the computational complexity is very suitable for nodes composed of millions of nodes as demonstrated previously.
  • the main computational limitation is likely to be the RWR part, as time complexity depends on the number of edges.
  • the general logical flow of the method can be found in FIG.2.
  • the multiplex-heterogeneous network was composed of one human molecular multiplex network (3 layers), one drug multiplex network (4 layers) and one disease monoplex network.
  • the multiplex networks are linked by 3 bipartite networks: drug-disease, gene-disease, and drug-target networks.
  • the multiplex networks are the following: [0072] Human molecular multiplex network: This network is a molecular network, extracted from [8], composed of three layers.1.
  • the first layer is a protein-protein interaction (PPI) layer which integrates 4 datasets: Hi-Union, APID (apid.dep.usal.es) (Level 2, human only), Lit-BM.2.
  • the second layer is a pathways layer constructed from the human reactome data extracted from NDEx [33].3.
  • the third layer is a molecular complex layer corresponding to the fusion of Hu.ap and Corum.
  • Drug multiplex network The multiplex drug network integrates several sources and interaction types and has been extracted from Baptista et al., Commun Phys.2022; 5(1):170.
  • drugs are named according to DrugBank conventions, encompassing both the multiplex network and its associated bipartite networks.1.
  • the first layer corresponds to clinical drug interactions. It includes 14,822 clinically reported adverse drug-drug interactions among 667 drugs.2.
  • the second is the experimental drug combinations layer. It contains 737 experimentally validated drug combinations involving 376 drugs.3.
  • the third represents the predicted drug combinations and includes 2,080 network-predicted combinations for hypertensive drugs, covering 65 different drugs.4.
  • the last layer of the drug multiplex network includes the pharmacologic drug-drug interactions and consists of 48,514 interactions determined by the pharmacological effects of one drug on another, involving 1,514 drugs.
  • Disease network The disease multiplex network (DIS) has been structured into two layers, each representing different aspects of disease relationships.1.
  • Disease-Disease Network Based on Shared Symptoms Originating from a bipartite disease-symptoms network from Zhoue et al, Nat Commun.2014;5(1):1-10, this layer forms connections based on the cosine distance between diseases, retaining all interactions where this distance is above 0.5, indicating significant symptom overlap.2.
  • Comorbidity Network This layer integrates epidemiological data from Jensen et al., Nat Commun 2014;5(1)1-10 to illustrate the comorbidity relationships among diseases, highlighting epidemiological correlations. [0075] Each layer provides a unique perspective on disease interactions, encompassing treatment similarities, symptom relationships, and epidemiological data. [0076]
  • the bipartite networks are: [0077] Gene-Disease Network: We extracted the curated gene-disease bipartite network from the DisGeNET database in order to connect the two molecular and disease multiplex networks.
  • Drug-target Network This network combines multiple sources including DrugBank Release Version 5.1.8, DrugCentral release v10.12, and associations described in other publications.
  • the binary classifier’s training method included the utilization of various operators on the node embeddings. These operators comprised Hadamard, Weighted- L1, Weighted-L2, Average, and cosine.
  • the objective of this validation method was to ascertain the quality of the embeddings in the discovery of novel drug-gene-disease associations. At present, conducting direct comparative analyses with alternative methodologies is not practicable due to the unique nature of the embedding process for multilayer networks with three distinct node types from 3 different multiplex networks, a feature not yet paralleled in the existing literature.
  • Case study on cancer and neurotransmitters [0085] The second approach we used for validation was to test the method on a case study, here to assess the link between neurotransmitter and cancer.
  • Serotonin has already been linked to cancer and we wanted to know if other neurotransmitters might be predicted by our model which could lead to the discovery of new targets and drug repositioning for cancer.
  • We focused on biological modules. Once MultiXVERSE had been applied to the drug-disease-gene multilayer network, we used a clustering method on the embeddings and analyzed the clusters. The clustering method we applied was spherical k-means with ⁇ 500 applied on the embedding. [0087] We then applied link prediction to GABA agonist drugs using a Random Forest classifier with the operator ’Average’.
  • Xenopus Laevis embryos were fertilized in vitro according to standard protocols, from eggs obtained from the adult frogs living in our Xenopus facility.
  • Drug Exposure [0092] Stocks of muscimol (Tocris 0289) were kept at 10 mM concentration in DMSO. Embryos were exposed in 0.1X MMR during stages 12-43 in muscimol at a final concentration of 50 ⁇ M. [0093] Histology [0094] Embryos at stage 43-45 were embedded in JB4 according to the manufacturer’s directions (Polysciences) and sectioned on a Leica microtome at 20 ⁇ . They were then photographed on a Nikon SMZ-1500 microscope.
  • Link predictions are computer for the bipartite interactions of multiplex-heterogenous networks. We applied our evaluation protocol 10 times and found ROC-AUC superior to 0.9 with Average operators for all bipartite networks. The scores higher than 0.9 are highlighted in bold. [0100] The variance across all operators is minimal, indicating that the network embedding method demonstrates high robustness and consistency in each iteration of the link prediction evaluation test. [0101] Consistency of MultiXVERSE with MultiVERSE Results on the Progeria Cluster [0102] To assess the quality of the clustering of our embeddings, we analyzed the progeria cluster (see FIG.3). Hutchinson-Gilford Progeria Syndrome (HGPS) is a rare genetic disorder that causes premature aging.
  • HGPS Hutchinson-Gilford Progeria Syndrome
  • HGPS arises from mutations in the LMNA genes, leading to the production of a deleterious version of the Lamin A protein, known as Progerin.
  • the results are similar to those obtained with MultiVERSE: LMNA and HGPS were both found to be associated.
  • ZMPSTE24 was found to be associated.
  • KCNK13 is an especially interesting gene and encodes a K+ potassium ion channel (Potassium Two Pore Domain Channel Subfamily K Member 13). It is related to the Birk-Barel syndrome (BIBARS) - a rare genetic disorder characterized by motor and speech delay, impaired intellectual development, early feeding difficulties, muscular hypotonia, hyperactivity, aggression, and facial dysmorphism. This syndrome shares part of its phenotype with HGPS. HGPS has also been related to bioelectricity which can fall under the context of aging (or premature aging) as a channelopathy.
  • BIBARS Birk-Barel syndrome
  • One cluster includes EPO and Darbepoetin alpha. Recombinant human erythropoietin is commonly used in clinical settings to treat anemia associated with cancer and chemotherapy. However, recent clinical trials indicate that rhEPO might also negatively affect disease progression and patient survival. Interestingly, EPO is known to increase GABA currents suggesting an implication of GABA neurotransmitter in the adverse effect of EPO in cancer development.
  • a second cluster is linking developmental disorders, neurotransmitter drugs (gabapentin with lamotrigine) with serotonin syndrome and large cell carcinomas (see Fig.4B).
  • Gabapentin is a structural analogue of the inhibitory neurotransmitter gamma-aminobutyric acid (GABA).
  • GABA inhibitory neurotransmitter gamma-aminobutyric acid
  • Lamotrigine is an anti-glutamate agent and may enhance GABAergic transmission. Lamotrigine can also augment serotonin re-uptake inhibitors.
  • This cluster suggests a link between GABA, serotonin and cancer, and that has been recently studied.
  • FIG.4C shows a cluster including developmental disorders and Trimetazidine, which is an anti-ischemic drug that can inhibit platelet aggregation and regulate the expression of serotonin in a rodent model.
  • muscimol a well-known GABA agonist. Muscimol itself is not in the original data we used in our model but it is a GABA(A) agonist like Progabide [79, 80] that was found in the link prediction (see above), enabling us to test the utility of the model’s categorical predictions for novel drugs that it did not have direct experience with. Fifty Xenopus embryos, in triplicate, were exposed to 50 ⁇ M muscimol between stage 12 to stage 45 (after completion of gastrulation through swimming tadpole stages), and then sectioned.
  • MultiXVERSE By providing a universal, scalable framework for multilayer network embedding, MultiXVERSE enables the systematic exploration of molecular and phenotypic interactions across diverse biological contexts. Our experimental validation of the predicted link between GABA and cancer using Xenopus laevis underscores its capability to generate biologically meaningful hypotheses and accelerate breakthroughs in multi-omics research. [0126] Future directions include applying MultiXVERSE to additional multi-omics datasets and integrating it with high-throughput experimental pipelines for systematic hypothesis generation and validation, particularly in drug discovery. Beyond its biological applications, MultiXVERSE is a versatile tool that can be utilized for analyzing multilayer networks in a wide range of fields, including social sciences and other complex systems.
  • Erythropoietin increases gabaa currents in human cortex from tle patients. Neuroscience.2020;439:153–62. [0180] 53.Rose M, Kam P. Gabapentin: pharmacology and its use in pain management. Anaesthesia.2002;57(5):451–62. [0181] 54.Costa B, Vale N. Understanding lamotrigine’s role in the cns and possible future evolution. Int J Mol Sci.2023;24(7):6050. [0182] 55.Reid JG, Gitlin MJ, Altshuler LL. Lamotrigine in psychiatric disorders. J Clin Psychiatry.2013;74(7):675–84.
  • FIG.6 shows an example process 600 to characterize a multilayer network.
  • a network is obtained.
  • the network may be a multilayer network made of a plurality of multiplex heterogeneous networks and a plurality of bipartite networks connecting the multiplex heterogeneous networks in the plurality of multiple heterogeneous networks.
  • each multiple heterogeneous network in the plurality of multiple heterogeneous networks includes a plurality of nodes.
  • a trained machine learning algorithm may be applied to relate the plurality of multiplex heterogeneous networks to an embedding space.
  • the trained machine learning algorithm may use random walk with restart.
  • the embedding space of the plurality of multiplex heterogeneous networks can be used to extract information to corresponding to interactions between the plurality of nodes of each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks.
  • the embedding space and extracted information may be output. In some embodiments, the information may be output to a user.
  • communication network 702 can be any suitable communication network or combination of communication networks.
  • communication network 702 can include a Wi-Fi network (which can include one or more wireless routers, one or more switches, etc.), a peer-to- peer network (e.g., a Bluetooth network), a cellular network (e.g., a 4G network, a 5G network, etc., complying with any suitable standard, such as CDMA, GSM, LTE, LTE Advanced, WiMAX, etc.), a wired network, etc.
  • Wi-Fi network which can include one or more wireless routers, one or more switches, etc.
  • peer-to- peer network e.g., a Bluetooth network
  • a cellular network e.g., a 4G network, a 5G network, etc., complying with any suitable standard, such as CDMA, GSM, LTE, LTE Advanced, WiMAX, etc.
  • a wired network etc.
  • communication systems 712 can include any suitable hardware, firmware, and/or software for communicating information over communication network 702 and/or any other suitable communication networks.
  • communications systems 712 can include one or more transceivers, one or more communication chips and/or chip sets, etc.
  • communications systems 712 can include hardware, firmware and/or software that can be used to establish a Wi-Fi connection, a Bluetooth connection, a cellular connection, an Ethernet connection, etc.
  • memory 714 can include any suitable storage device or devices that can be used to store instructions, values, etc., that can be used, for example, by processor 706 to present content using display 708, to communicate with server 716 via communications system(s) 712, etc.
  • Memory 714 can include any suitable volatile memory, non-volatile memory, storage, or any suitable combination thereof.
  • memory 714 can include RAM, ROM, EEPROM, one or more flash drives, one or more hard disks, one or more solid state drives, one or more optical drives, etc.
  • memory 714 can have encoded thereon a computer program for controlling operation of computing device 704.
  • processor 706 can execute at least a portion of the computer program to present content (e.g., images, user interfaces, graphics, tables, etc.), receive content from server 716, transmit information to server 716, etc.
  • server 716 can include a processor 718, a display 720, one or more inputs 722, one or more communications systems 724, and/or memory 726.
  • processor 718 can be any suitable hardware processor or combination of processors, such as a central processing unit, a graphics processing unit, etc.
  • display 720 can include any suitable display devices, such as a computer monitor, a touchscreen, a television, etc.
  • inputs 722 can include any suitable input devices and/or sensors that can be used to receive user input, such as a keyboard, a mouse, a touchscreen, a microphone, etc.
  • communications systems 724 can include any suitable hardware, firmware, and/or software for communicating information over communication network 702 and/or any other suitable communication networks.
  • communications systems 724 can include one or more transceivers, one or more communication chips and/or chip sets, etc.
  • communications systems 724 can include hardware, firmware and/or software that can be used to establish a Wi-Fi connection, a Bluetooth connection, a cellular connection, an Ethernet connection, etc.
  • memory 726 can include any suitable storage device or devices that can be used to store instructions, values, etc., that can be used, for example, by processor 718 to present content using display 720, to communicate with one or more computing devices 704, etc.
  • Memory 726 can include any suitable volatile memory, non-volatile memory, storage, or any suitable combination thereof.
  • memory 726 can include RAM, ROM, EEPROM, one or more flash drives, one or more hard disks, one or more solid state drives, one or more optical drives, etc.
  • memory 726 can have encoded thereon a server program for controlling operation of server 716.
  • processor 718 can execute at least a portion of the server program to transmit information and/or content (e.g., results of a tissue identification and/or classification, a user interface, etc.) to one or more computing devices 704, receive information and/or content from one or more computing devices 704, receive instructions from one or more devices (e.g., a personal computer, a laptop computer, a tablet computer, a smartphone, etc.), etc.
  • information and/or content e.g., results of a tissue identification and/or classification, a user interface, etc.
  • processor 718 can execute at least a portion of the server program to transmit information and/or content (e.g., results of a tissue identification and/or classification, a user interface, etc.) to one or more computing devices 704, receive information and/or content from one or more computing devices 704, receive instructions from one or more devices (e.g., a personal computer, a laptop computer, a tablet computer, a smartphone, etc.), etc.
  • any suitable computer readable media can be used
  • non-transitory computer readable media can include media such as magnetic media (such as hard disks, floppy disks, etc.), optical media (such as compact discs, digital video discs, Blu-ray discs, etc.), semiconductor media (such as RAM, Flash memory, electrically programmable read only memory (EPROM), electrically erasable programmable read only memory (EEPROM), etc.), any suitable media that is not fleeting or devoid of any semblance of permanence during transmission, and/or any suitable tangible media.
  • media such as magnetic media (such as hard disks, floppy disks, etc.), optical media (such as compact discs, digital video discs, Blu-ray discs, etc.), semiconductor media (such as RAM, Flash memory, electrically programmable read only memory (EPROM), electrically erasable programmable read only memory (EEPROM), etc.), any suitable media that is not fleeting or devoid of any semblance of permanence during transmission, and/or any suitable tangible media.
  • EPROM electrically programmable read only
  • transitory computer readable media can include signals on networks, in wires, conductors, optical fibers, circuits, or any suitable media that is fleeting and devoid of any semblance of permanence during transmission, and/or any suitable intangible media.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Algebra (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Systems and methods for characterizing a multilayer network, including: obtaining a multilayer network including a plurality of multiplex heterogenous networks and a plurality of bipartite networks connecting the multiplex heterogeneous networks in the plurality of multiplex heterogeneous networks, in which each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks includes a plurality of nodes; applying a trained machine learning algorithm to relate the plurality of multiplex heterogeneous networks to an embedding space, in which the trained machine learning algorithm uses random walk with restart; using the embedding space of the plurality of multiplex heterogeneous networks to extract information corresponding to interactions between the plurality of nodes of each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks; and outputting the embedding space and the extracted information to a user.

Description

METHODS AND SYSTEMS FOR MODELING MULTILAYER NETWORKS CROSS-REFERENCE TO RELATED APPLICATIONS [0001] This application claims priority to U.S. Provisional Application No.63/665,915 filed June 28, 2024. The contents of which is herein incorporated by reference in its entirety. BACKGROUND [0002] The volume and complexity of biological data have significantly increased in recent years, often represented as network models continue to increase at a rapid pace. However, drug discovery in the context of complex phenotypes is hampered by the difficulties inherent in producing machine learning algorithms that can integrate molecular-genetic, biochemical, physiological, and other diverse datasets. Recent developments have expanded network analysis techniques, such as network embedding, to effectively explore multilayer network structures. Multilayer networks, which incorporate various nodes and connections in formats such as multiplex, heterogeneous, and bipartite networks, provide an effective framework for merging diverse and multi-scale biological data sources. However, current network embedding methods face challenges and limitations in addressing the heterogeneity and diversity of these networks. Therefore, there is an essential need for the development of new network embedding methods to manage the complexity and diversity of multi-omics biological information effectively. SUMMARY [0003] Disclosed herein are methods, systems, and computer readable media for characterizing a multilayer network. In one embodiment, the method includes: obtaining a multilayer network including a plurality of multiplex heterogenous networks and a plurality of bipartite networks connecting the multiplex heterogeneous networks in the plurality of multiplex heterogeneous networks, in which each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks includes a plurality of nodes; applying a trained machine learning algorithm to relate the plurality of multiplex heterogeneous networks to an embedding space, in which the trained machine learning algorithm uses random walk with restart; using the embedding space of the plurality of multiplex heterogeneous networks to extract information corresponding to interactions between the plurality of nodes of each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks; and outputting the embedding space and the extracted information to a user. BRIEF DESCRIPTION OF THE DRAWINGS [0004] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings(s) will be provided by the Office upon request and payment of the necessary fee. [0005] FIG.1 shows a universal multilayer network consisting of 3 distinct multiplex networks (gene, drug and disease), each represented by different colors (yellow, purple, and blue). Each of these multiplex networks consists of various types of nodes: squares are genes, circles are drugs and triangles are diseases. They are interconnected through three bipartite networks (gene-disease, drug-target and drug-disease), which are visualized here as bipartite interactions for clarity. The number of multiplex networks and layers inside the multiplex networks is arbitrary and could be more [0006] FIG.2 shows MultiXVERSE logical flow (figure adapted from [8]). We start with the adjacency matrix of the multilayer network; then, we apply universal random walk with restart to obtain the similarity matrix; and finally we apply the VERSE algorithm to compute the embeddings for diverse types of applications. [0007] FIG.3 shows network representation of the progeria cluster. Blue, orange are respectively genes, diseases. Black, light blue, and red links are respectively molecular multiplex, gene-disease, and disease multiplex links [0008] FIGS.4A-4C show network representations of the clusters integrating cancer or developmental disorders and neurotransmitters. (FIG.4A) Network cluster linking drugs regulating GABA and melanoma. (FIG.4B) Network cluster linking GABA drugs like gabapentin and large cell carcinoma. (FIG.4C) Network cluster linking trimetazidine and malformations of cortex development. Blue, orange and green boxes are respectively genes, diseases and drugs. Black, light blue, yellow, light green and red links are respectively molecular multiplex, gene-disease, drug-target, drug-disease, and disease multiplex links. [0009] FIGS.5A-5E show melanocyte conversion phenotype induced by muscimol exposure. All panels are transverse sections; (FIG.5A, FIG.5B) are taken through the gut, while (FIG.5C, FIG.5D) are taken through the mid-tail. (FIG.5A) Control embryos show small numbers of discrete, round melanocytes (white arrows). (FIG.5B) In contrast, muscimol exposure induces melanocytes to over-proliferate and form long, stretched out projections that cover the gut cavity and other locations (red arrows). Compared to round melanocytes in the trunk and tail of controls (FIG.5C), melanocytes in muscimol-treated animals (FIG.5D) can be clearly seen to have an abnormal invasive shape and distribution (FIG.5D). (FIG.5E) Closeup of melanocytes invading the neural tube after muscimol exposure. Nt = neural tube. [0010] FIG.6 shows an example process to characterize a multilayer network in accordance with some embodiments of the disclosure. [0011] FIG.7 shows an example system configured to characterize a multilayer network in accordance with some embodiments of the disclosure. DETAILED DESCRIPTION [0012] In accordance with some embodiments of the disclosed subject matter, methods, systems, and computer readable media for characterizing a multilayer network are provided. [0013] Networks [0014] A network is made of a set of nodes and edges which connect the nodes. Nodes represent elements of a network, and edges define how the elements interact. For instance, in a protein-protein network, a node may be a particular protein, and an edge may determine physical interaction. [0015] Edges may be unweighted (e.g., no assigned value or interaction) or weighted (e.g. have an assigned value that influences how an edge interacts with different nodes). A node may also be directed or undirected. A directed edge may include and assigned value which dictates that node A affects node B, but node B does not affect node A or affect node A to a lesser degree. [0016] A multilayer network may be used to model systems with different levels of interactions. One interpretation of a multilayer network is a network including two or more layers, each layer including a set of nodes and edges, and edges connecting nodes on the nodes in the layer, as well as the different layers. A visual representation of a multilayer network is shown in FIG.1. A multiplex heterogeneous is a multilayer network that may include monoplex and multiplex networks connected through bipartite interaction networks. A monoplex network refers to a one-layer network composed of one type of nodes and one type of edges. A multiplex network refers to a multilayer network in which each layer is a monoplex network. In some embodiments, all layers in a multiplex network include the same set of nodes, but the edges are in different categories. A bipartite network includes transitions between different types of nodes in different networks (e.g., different multiplex or monoplex networks). [0017] A homogenous network refers to a network in which all nodes are the same type of node, and all edges are the same type of edge. A heterogeneous network refers to networks that link different types of nodes through bipartite interactions. [0018] A multiplex-heterogeneous multilayer network is a multilayer network made up of any number of monoplex and/or multiplex networks, linked by bipartite networks. The edges in a monoplex/multiplex network may be directed and/or weighted. A multiplex-heterogeneous multilayer network generally includes both networks with unweighted edges, and networks with weighted and/or directed edges. The edges in a bipartite network may similarly be directed and/or weighted. [0019] In some embodiments, the multiplex-heterogeneous multilayer network includes more than three multiplex heterogenous networks, more than three bipartite networks, and at least two multiplex networks linked by one bipartite. [0020] In some embodiments, the multiplex-heterogeneous networks comprises a drug- disease-gene networks composed of 3 multiplex networks (drug-drug, disease-disease, and gene- gene multipelx networks) and 3 bipartite (drug-target, drug-disease and disease-gene). The number of layers in the different multiplex networks will depend on the quality of the data. In some embodiments, the network may be used for drug discovery. [0021] In previous methods, such as MultiVERSE, such a network could not be embedded as it was limited to 2 multiplex networks and one bipartite network. In other previous work, such as MultiXrank, the methods could only apply RWR on this network and was incapable of any embeddings. [0022] Network Embedding [0023] Network embedding is a method of encoding high-dimensional network information into feature vectors. Network embedding may be used for a variety of applications, including community detection, node classification, and link prediction. The methods described herein allow network embedding of multilayer networks, including multiplex heterogeneous, without limitation. [0024] In some embodiments, network embedding includes determining a similarity matrix as a probability distribution (see FIG.2 for a schematic of an example process). Random walk with restart may be used to determine a probability similarity matrix between nodes in a network. Random walk with restart is a computationally efficient way of exploring and optimizing the similarity matrix. Random walk with restart includes parameters that regulate the random walk’s movement across layers, networks, and bipartite connections. [0025] In some embodiments, determining network embedding includes using machine learning approaches to optimize the embeddings. The machine learning approaches can include any suitable approaches, including matrix factorization, hypergraph embeddings, neural networks, or artificial intelligence. In some embodiments, Kullback-Leibler minimization. [0026] Embeddings of genes, disease, and drugs can then be used for drug repositioning, finding synergy of drugs, identify new genes associated to a disease, or new biological communities associated to a disease. In some embodiments, the embeddings may be used as input for an LLM. Having embeddings opens the complete machine learning toolbox for subsequent analysis. [0027] Datasets for Determining Multilayer Networks [0028] In some embodiments, a multilayer network may correspond to a dataset that determines the layers, edges, and nodes of the network. For example, the multilayer network used to explore a disease may have three networks: a human molecular network (e.g., a network representing molecular systems and interactions in humans), a drug network (e.g., a network representing drug interactions), and a disease network (e.g., a network representing aspects of disease relationships). The bipartite networks may then determine the interactions between the networks. [0029] The datasets may be determined by published literature and/or experimental data. In some embodiments, the characterization of the multilayer network may be compared against experimental data. [0030] General Applications [0031] Once the embeddings have been determined, additional machine learning methods may be used to determine further information about the multilayer network. For instance, methods can be used for link prediction, clustering, joint embeddings, and other information. The information may be output to a user, e.g., a researcher or a clinician. Machine learning approaches, such as clustering approaches, neural networks, random trees, artificial intelligence, or other relevant machine learning approaches, may be used to interpret the embeddings and the information extracted from the embeddings. [0032] Advantages [0033] The methods described herein may be used to determine network embeddings of multilayer, multiplex heterogeneous networks, a long-felt need in the field of modeling complex networks. This is particularly important for modeling biological networks, which are known to be complex and include a variety of different types of interactions. The multiplex-heterogeneous networks described herein include networks including several types of nodes and edges, and linking same types of nodes and different types of nodes. Additionally, the method may include more than one bipartite network. The methods described herein can be applied on any type of multilayer network, whatever the number of multiplex network, and bipartite linking them, on multiplex composed of directed, undirected, weighted, unweighted networks. [0034] Additional Applications [0035] In some embodiments, the multilayer network may correspond to a dataset corresponding to a disease. The datasets may include at least one of a molecular dataset, a drug dataset, or a disease dataset. The network embeddings may be used to identify at least one drug related to a disease. The method may be used to determine new applications for known drugs (e.g., drug repurposing), identify drug interactions, or identify a new drug or drug target to treat a disease. [0036] In some embodiments, the methods described herein may be used to find new genes associated with a disease. This may be done by clustering the embeddings after the embeddings have been determined. [0037] In some embodiments, the methods described herein may be used for drug repositioning (e.g., using a known drug for a new purpose, such as treating a different disease). In some embodiments, the embeddings may identify several drugs that are associated with a specific disease. In cases such as these, the methods may be used to find drug synergies. [0038] The methods described herein may be used to characterize a variety of systems related to specific applications. For instance, the characterized networks may be used to study birth defects, regenerative medicine, aging, degenerative disease, cancer, neurological disease, in vitro bioengineering, synthetic food production, and agriculture. [0039] Examples [0040] Example 1 - Universal multilayer network embedding reveals a causal link between GABA neurotransmitter and cancer [0041] Network graph models are highly effective for depicting real-world objects through their relationships and interactions. They offer valuable insights into the connections between different entities and are utilized as tools to investigate complex systems across various fields. A significant challenge in machine learning involves converting high-dimensional graph- based data into a feature vector. Indeed, these methods were originally designed for vector data and cannot be directly applied to biological datasets such as biological networks. Network embedding, also known as graph representation learning, addresses this issue by transforming network data into formats compatible with conventional machine learning tools, thereby broadening the scope of machine learning applications in network analysis. [0042] Network embedding techniques have proven highly effective across numerous applications, including community detection, node classification, and link prediction. Capable of handling vast networks with millions of nodes, these techniques are particularly valuable in the era of big data. Consequently, network embeddings are increasingly used to analyze various large-scale networks, such as social, neuronal or molecular networks. [0043] The volume and complexity of biological data have significantly increased in recent years, often represented as multilayer network models. Multilayer networks, which incorporate various nodes and connections in formats such as multiplex, heterogeneous, and bipartite networks, provide an especially effective framework for merging diverse and multi- scale biological data sources. However, current network embedding methods face challenges and limitations in addressing the heterogeneity and diversity of these networks. Therefore, there is an essential need for the development of new network embedding methods to manage the complexity and diversity of multilayer networks effectively. [0044] We developed a universal multilayer network embedding method that we named MultiXVERSE. Our method can handle any multilayer network defined as a composition of various multiplex and monoplex networks interconnected through bipartite interaction networks (see FIG.1 for an example multilayer network). Within this multilayer structure, each network may also be weighted and/or directed. And we can add as many multiplex and bipartite networks as we want with this extension without limitations, except of course computational power. [0045] Consequently, MultiXVERSE provides a means to network embedding on these multilayer networks, which are characterized by their rich and complex interactions. This approach is particularly effective in representing the multi-scale interactions typically observed in biological systems. For biology, this approach allows us to aggregate network data from drugs, diseases, genes, patients, etc. in the same network representation and machine learning can be applied on the resulting embeddings for a wide variety of application including drug repositioning, new predicted gene-disease or drug-target links, the discovery of specific biological functional modules for diseases integrating genes and drugs, and more. Several methods have been developed recently in AI and the mining of knowledge graphs. To the best of our knowledge, it is the first time that network embedding can be applied to any kind of multilayer networks, including multiplex-heterogeneous networks, without any limitations on the number of multiplex networks or the type of networks (weighted, directed, undirected). [0046] In this article, we applied MultiXVERSE to a biological multilayer network containing data on gene, drug, and disease interactions and evaluated the quality of the embedding using link prediction (a standard approach in multilayer network embedding). Second, we clustered the embeddings to find functional biological modules, which revealed new predictions of a link between GABA and cancer. Third, we applied link prediction to the embeddings of GABA agonists drugs and found new links between GABA receptors and cancer. Neurotransmitters are emerging targets in cancer and as such provide a case study particularly suited for evaluating our system. Finally, we experimentally tested the new prediction of GABA as a potential cause of cancer in a tadpole melanocyte model and validated the prediction linking GABA-modulating drugs to a cancer-like phenotype in the absence of classic carcinogens, oncogenes, or DNA damage as the initiator of cellular conversion. [0047] We attempted to find similar methods in the literature, but unfortunately without success. Existing methods, such as PMNE and MNE (Zhang, H., et al. IJCAI.2018; 18:3082-8), MVE (Qu M., et al. Proceedings of the 2017 ACM on Information and Knowledge Management, pp.1767-1776, 2017), mvn2vec (Shi H. et al. arXiv preprint, 2018), DualHGNN (Liao, J., et al. IJCNN, pp.1-10, IEEEE, 2023), even though described sometimes under the category of multiplex heterogeneous networks, are only heterogeneous in the sense that they allow several types of edges linking the nodes; they can embed multiplex networks but can’t embed multiplex- heterogeneous networks such as ours, e.g., composed of several types of nodes and edges linking same type of nodes and different types of nodes, and several bipartites. Also, most of the methods above rely on the manual construction of meta-paths, which limits their generality, which is not a limitation in the case of MultiXVERSE. [0048] Materials and Methods [0049] Within the MultiXVERSE framework, it is necessary to formulate a similarity metric for the multiplex-heterogeneous network, designated as ^^^^^^. This metric, denoted ^^^^^^ீ: ^^ெு ൈ ^^ெு → ℝ maps pairs of nodes within ^^^^^^ to a real number, reflecting their level of similarity. It is defined as follows: [0050] ∀௩∈ ^^ெு ,∑௩∈^ ^^^^^^ீ^^^, ^^^ ൌ 1 . the similarity metric for any given node ^^ within the multiplex-heterogeneous network ^^^^^^, is conceptualized as a probability distribution. Given that, one can obtain the normalized similarity distribution within the embedding space by applying the softmax function. Formally, let ^^^^ represent the embedding of node ^^ within this space. Consequently, the similarity between the embeddings of two nodes ^^^^ and ^^^^ is characterized by the dot product ^^ ் ௨ ⋅ ^^௩ , yielding the following expression: [0052] ^^^^^^ ^ ^ ^^୮ ^௪⋅௪^^ ா^^ ^^, . ൌ ∑ ^ ^సభ ௪௫^^௪ೡ⋅௪^^ closely estimate the similarity distribution within the embedding space, represented as ^^^^^^ீ: ^^ெு ൈ ^^ெு → ℝ , such that for all ^^ in ^^^^^^, the relationship ^^^^^^ீ^^^, . ^ is approximated by ^^^^^^ா^^^^^, . ^ .The optimization during the learning phase is executed through the minimization of the Kullback-Leibler divergence between the two similarity measures: [0054] ∑௩∈^ಾ ^^^^^^^^^^^ீ^^^, . ^ ∥ ^^^^^^ா^^^^^, . ^^ [0055] By keeping only the terms related to simEmb as simG as constant, we derive the objection function as follows: [0056] ℒ ൌ െ∑௩∈^ಾ ^^^^^^ீ^^^, . ^^^^^^^^^^^^^^^^ா^^^^^, . ^^ as a softmax function, it necessitates normalization across the entire network’s nodes, a process that is computationally intensive. Noise Contrastive Estimation (NCE) is utilized to approximate these computations. [0058] NCE trains a binary classifier to differentiate between node pairs sampled from the graph similarity distribution ^^i^^^^ and those obtained from a noise distribution Q. [0059] We define D as a random variable denoting class labels, where ^^=0 if a node is sampled from the noise distribution Q, and ^^=1 if it is drawn from the empirical distribution. The expected value operator is denoted by ^^. Given a node u sampled from ^^ and another node v drawn from ^^^^^^ீ^^^, . ^ NCE generates ^^<^^ negative samples ^^^^^^^^ from the noise distribution Q(u). [0060] Under this formulation, the objective function is expressed as the negative log- likelihood, which is minimized using logistic regression: [0061] ℒே^ா ൌ ∑ ௨~℘ ^^^^^^^^^^^^^ ൌ 1|^^^^^^ா^^^^^, ^^^^ ^ ^^.^^௩ ൌ [0062] where ^^^^ is computed using the sigmoid function ^^^^^^ ൌ ^1 ^ ^^ି௫^ି^ applied to the dot product between embeddings ^^^^ and ^^^^. The similarity function ^^i^^^^^^b(^^,.) is computed without normalization. It has been demonstrated that as s increases, the derivative of NCE approaches the gradient of cross-entropy, though in practice, small values are often sufficient. Consequently, this approach effectively minimizes the KL-divergence between simG and its learned representation. [0063] In summary, VERSE offers a general framework for network embedding, with the primary requirement that simG be a probability distribution. In this framework, the similarity simG in the multiplex-heterogenous network is computed using MultiXrank. And MultiXVERSE applies Kullback-Leibler minimization to optimize the embeddings. [0064] The parameters for random walks with restart on the multilayer networks are the ^ ^ ^ ଷ ଷ ଷ ^ ^ ^ ^ ^ By default, we set this parameter to 0.7, which is a value often used in the literature. The parameters ^^, ^^, ^^1, ^^2, ^^3, and ^^ in the MultiXrank framework define different probabilities governing transitions in the random walk with restart (RWR) on universal multilayer networks. [0065] The parameter ^^ represents the probability of restarting the random walk within a given multiplex network, ensuring that the walker does not transition to another multiplex network at every step. The sum of all ^^ values across multiplex networks is constrained to equal 1. We assigned the same probability of restarting in each multiplex. The parameter ^^ controls the probability of jumping between multiplex networks, allowing transitions across bipartite connections and ensuring that the walk spreads across the universal multilayer structure. We assigned the same probability of jumping in each multiplex. [0066] The parameters ^^1, ^^2, and ^^3 define the probability of moving between layers within the same multiplex network, thereby capturing layer-specific transition dynamics. In our case, the random walker explores several multiplex networks, the probability is defined as 0.5 for each multiplex; therefore, the random walker has the same probability to jump from one layer to another as to stay in the layer. The parameter ^^ governs the probability of restarting the random walk at a specific layer within a multiplex network, ensuring a balanced exploration of different layers. The sum of all ^^ values within a multiplex network must also sum to 1. Here, ^^ has been defined to assign the same probability of restarting in one layer of a specific multiplex network. [0067] These parameters together regulate the random walk’s movement across layers, multiplex networks, and bipartite connections, ensuring a structured and controlled exploration of the multiplex-heterogeneous network. [0068] RWR and NCE are known to be fast and efficient methods; the computational complexity is very suitable for nodes composed of millions of nodes as demonstrated previously. The main computational limitation is likely to be the RWR part, as time complexity depends on the number of edges. However, it has been demonstrated to be very practical for large networks with millions of nodes, something we don’t often have in biological datasets. The general logical flow of the method can be found in FIG.2. [0069] Datasets for Gene-Drug-Disease Multilayer Network [0070] We used several different datasets to construct the multiplex-heterogeneous network, which was composed of one human molecular multiplex network (3 layers), one drug multiplex network (4 layers) and one disease monoplex network. The multiplex networks are linked by 3 bipartite networks: drug-disease, gene-disease, and drug-target networks. [0071] The multiplex networks are the following: [0072] Human molecular multiplex network: This network is a molecular network, extracted from [8], composed of three layers.1. The first layer is a protein-protein interaction (PPI) layer which integrates 4 datasets: Hi-Union, APID (apid.dep.usal.es) (Level 2, human only), Lit-BM.2. The second layer is a pathways layer constructed from the human reactome data extracted from NDEx [33].3. The third layer is a molecular complex layer corresponding to the fusion of Hu.ap and Corum. [0073] Drug multiplex network: The multiplex drug network integrates several sources and interaction types and has been extracted from Baptista et al., Commun Phys.2022; 5(1):170. Data derived from Cheng et al., Nat Commun.2019; 10(1):1-11 and the pharmacological drugs interaction network available at snap.stanford.edu were utilized. In this network, drugs are named according to DrugBank conventions, encompassing both the multiplex network and its associated bipartite networks.1. The first layer corresponds to clinical drug interactions. It includes 14,822 clinically reported adverse drug-drug interactions among 667 drugs.2. The second is the experimental drug combinations layer. It contains 737 experimentally validated drug combinations involving 376 drugs.3. The third represents the predicted drug combinations and includes 2,080 network-predicted combinations for hypertensive drugs, covering 65 different drugs.4. The last layer of the drug multiplex network includes the pharmacologic drug-drug interactions and consists of 48,514 interactions determined by the pharmacological effects of one drug on another, involving 1,514 drugs. [0074] Disease network: The disease multiplex network (DIS) has been structured into two layers, each representing different aspects of disease relationships.1. Disease-Disease Network Based on Shared Symptoms: Originating from a bipartite disease-symptoms network from Zhoue et al, Nat Commun.2014;5(1):1-10, this layer forms connections based on the cosine distance between diseases, retaining all interactions where this distance is above 0.5, indicating significant symptom overlap.2. Comorbidity Network: This layer integrates epidemiological data from Jensen et al., Nat Commun 2014;5(1)1-10 to illustrate the comorbidity relationships among diseases, highlighting epidemiological correlations. [0075] Each layer provides a unique perspective on disease interactions, encompassing treatment similarities, symptom relationships, and epidemiological data. [0076] The bipartite networks are: [0077] Gene-Disease Network: We extracted the curated gene-disease bipartite network from the DisGeNET database in order to connect the two molecular and disease multiplex networks. [0078] Drug-target Network: This network combines multiple sources including DrugBank Release Version 5.1.8, DrugCentral release v10.12, and associations described in other publications. [0079] Disease-drug Network: The associations between diseases and drugs are obtained from the Comparative Toxicogenomics Database. [0080] Evaluation of the Approach [0081] Link prediction [0082] We evaluated the quality of the embeddings using link prediction and tested the predictions of our model experimentally. We employed link prediction to assess the efficacy of our embeddings and validate our universal multilayer network embedding method for network biology and medicine. Our link prediction methodology entailed initially removing 30% of bipartite edges randomly in each bipartite network to form a training multilayer network. Subsequently, we employed a Random Forest classifier to this training network, as described in [8], and performed evaluations on a withheld subset consisting of 30% of the edges. We repeated this evaluation protocol 10 times. The binary classifier’s training method included the utilization of various operators on the node embeddings. These operators comprised Hadamard, Weighted- L1, Weighted-L2, Average, and cosine. [0083] The objective of this validation method was to ascertain the quality of the embeddings in the discovery of novel drug-gene-disease associations. At present, conducting direct comparative analyses with alternative methodologies is not practicable due to the unique nature of the embedding process for multilayer networks with three distinct node types from 3 different multiplex networks, a feature not yet paralleled in the existing literature. [0084] Case study on cancer and neurotransmitters [0085] The second approach we used for validation was to test the method on a case study, here to assess the link between neurotransmitter and cancer. Serotonin has already been linked to cancer and we wanted to know if other neurotransmitters might be predicted by our model which could lead to the discovery of new targets and drug repositioning for cancer. [0086] In order to test our system on these new results, we focused on biological modules. Once MultiXVERSE had been applied to the drug-disease-gene multilayer network, we used a clustering method on the embeddings and analyzed the clusters. The clustering method we applied was spherical k-means with ^^=500 applied on the embedding. [0087] We then applied link prediction to GABA agonist drugs using a Random Forest classifier with the operator ’Average’. [0088] Experimental testing: Materials and Methods [0089] Animal Husbandry [0090] Animal care was done in compliance with, and approval from, the Institutional Animal Care and Use Committee (IACUC) under protocol number M2023-18 of Tufts University. Xenopus embryos were collected according to standard protocols in 0.1X MMR ((Marc’s Modified Ringers) pH 7.8 + 0.1% Gentamicin. Xenopus embryos were staged according to. All experiments were approved by the University Institutional Animal Care and Use Committee (IACUC) under the protocol number M2023-18. We have complied with all relevant ethical regulations for animal use. Xenopus Laevis embryos were fertilized in vitro according to standard protocols, from eggs obtained from the adult frogs living in our Xenopus facility. [0091] Drug Exposure [0092] Stocks of muscimol (Tocris 0289) were kept at 10 mM concentration in DMSO. Embryos were exposed in 0.1X MMR during stages 12-43 in muscimol at a final concentration of 50 ^^M. [0093] Histology [0094] Embryos at stage 43-45 were embedded in JB4 according to the manufacturer’s directions (Polysciences) and sectioned on a Leica microtome at 20^^. They were then photographed on a Nikon SMZ-1500 microscope. [0095] Results [0096] Computational Results [0097] Evaluation Results Using Link Prediction [0098] The ROC-AUC is superior to 0.9 with Average operators for all bipartite networks (see Table 1), meaning that the method can predict with high precision the removed 30% of gene-disease, drug-disease and drug-target links from the corresponding multiplex- heterogeneous networks. ROC-AUC Operators Gene-disease Drug-target Drug-disease Average Hadamard 0.88 ± 0.002 0.92 ± 0.001 0.88 ± 0.003 0.90 ± 0.005 Weighted_L1 0.88 ± 0.002 0.62 ± 0.01 0.77 ± 0.002 0.76 ± 0.005 Weighted_L2 0.88 ± 0.003 0.63 ± 0.01 0.76 ± 0.001 0.76 ± 0.004 Average 0.94 ± 0.002 0.93 ± 0.003 0.91 ± 0.002 0.93 ± 0.002 Cosine 0.55 ± 0.005 0.83 ± 0.004 0.70 ± 0.002 0.70 ± 0.004 [0099] Table 1: ROC-AUC scores for link prediction using MultiXVerse. Link predictions are computer for the bipartite interactions of multiplex-heterogenous networks. We applied our evaluation protocol 10 times and found ROC-AUC superior to 0.9 with Average operators for all bipartite networks. The scores higher than 0.9 are highlighted in bold. [0100] The variance across all operators is minimal, indicating that the network embedding method demonstrates high robustness and consistency in each iteration of the link prediction evaluation test. [0101] Consistency of MultiXVERSE with MultiVERSE Results on the Progeria Cluster [0102] To assess the quality of the clustering of our embeddings, we analyzed the progeria cluster (see FIG.3). Hutchinson-Gilford Progeria Syndrome (HGPS) is a rare genetic disorder that causes premature aging. It is characterized by symptoms such as slowed postnatal growth, facial structural abnormalities, premature cardiovascular diseases, lipodystrophy, hair loss, and widespread osteodysplasia. HGPS arises from mutations in the LMNA genes, leading to the production of a deleterious version of the Lamin A protein, known as Progerin. [0103] The results are similar to those obtained with MultiVERSE: LMNA and HGPS were both found to be associated. We found several genes in both clusters including ZMPSTE24 in the cluster that is associated to accelerated aging in the literature and LMNA, but also LEMD2, RGS18, MARVELD1, KCNK13, IZUMO2, PERM1, LINC01857, and KIF12n. The clusters share diseases including muscular dystrophy, the Werner syndrome (the adult premature aging syndrome), deformities of the han and foot, and cardiac disease associated with progeria. [0104] KCNK13 is an especially interesting gene and encodes a K+ potassium ion channel (Potassium Two Pore Domain Channel Subfamily K Member 13). It is related to the Birk-Barel syndrome (BIBARS) - a rare genetic disorder characterized by motor and speech delay, impaired intellectual development, early feeding difficulties, muscular hypotonia, hyperactivity, aggression, and facial dysmorphism. This syndrome shares part of its phenotype with HGPS. HGPS has also been related to bioelectricity which can fall under the context of aging (or premature aging) as a channelopathy. [0105] Therefore, we conclude that we have similar results to the previous version of MultiVERSE, even if we have more multiplex and bipartite networks (MultiVERSE has been applied on a gene-disease mutilayer network) and a different set of networks, showing a good robustness to the integration of new data. [0106] The Clustering of Embeddings Shows Serotonin and GABA Pathways Linked to Cancer and Developmental Disorder [0107] In order to learn more about the link between cancer, developmental disorders and neurotransmitters, we analyzed the different clusters integrating those three components. We found that several clusters show a link between neurotransmitters including GABA and serotonin with cancer of malformations. One cluster (see FIG.4A) includes EPO and Darbepoetin alpha. Recombinant human erythropoietin is commonly used in clinical settings to treat anemia associated with cancer and chemotherapy. However, recent clinical trials indicate that rhEPO might also negatively affect disease progression and patient survival. Interestingly, EPO is known to increase GABA currents suggesting an implication of GABA neurotransmitter in the adverse effect of EPO in cancer development. [0108] A second cluster is linking developmental disorders, neurotransmitter drugs (gabapentin with lamotrigine) with serotonin syndrome and large cell carcinomas (see Fig.4B). Gabapentin is a structural analogue of the inhibitory neurotransmitter gamma-aminobutyric acid (GABA). Lamotrigine is an anti-glutamate agent and may enhance GABAergic transmission. Lamotrigine can also augment serotonin re-uptake inhibitors. This cluster suggests a link between GABA, serotonin and cancer, and that has been recently studied. [0109] Lastly, we found a cluster (see FIG.4C) including developmental disorders and Trimetazidine, which is an anti-ischemic drug that can inhibit platelet aggregation and regulate the expression of serotonin in a rodent model. [0110] These results indicate GABA pathways as potentially implicated in cancer development in addition to serotonin pathways. Our model predicted links between cancer and serotonin and GABA via clustering, and the serotonin link has been validated by published data. We decided to test the link between GABA and cancer using link prediction. We found in the literature that there was a recent known link between GABA and cancer, drawn from analysis of clinical samples and functional data in vitro. [0111] GABA Drugs Show Different Types of Cancer in the First 10 Predictions [0112] The link between GABA and cancer has been studied before and it has been found that GABA has a driver role in controlling stem and proliferative cell state through GHB production in glioma. Membrane potential and GABA(A) receptor expression differences have been found between hepatic tumor versus non-tumor stem cells. However, contradictory evidence has been reported showing that GABA could have an inhibitory effect on tumor progression or cell proliferation. Given this contradictory evidence, we decided to validate further this link in our system. [0113] We applied link prediction using the embeddings on Baclofen, Zaleplon, Clobazam, Progabide, Zolpidem and Gabapentin. These drugs are GABA agonists. [0114] All of the tested drugs showed a link with cancer in the 10 first predictions, with the exception of Zolpidem which showed a link with rectum neoplasm at the 18th prediction. Baclofen had a prediction for adrenal cortical carcinoma, Zaplelon for cancer of the esophagus and adrenal cortical carcinoma, Clobazam for soft tissue sarcoma and experimented neoplasm, Progabide for soft tissue neoplasm, and Gabapentin for soft tissue neoplasm. [0115] The new prediction of GABA agonist as a potential trigger of cancer had not yet been validated; thus, we sought to test it experimentally. [0116] Experimental Results [0117] In order to test the prediction that GABA pathway modulation should functionally induce a cancer-like phenotype in vivo, we used larval Xenopus laevis — a powerful model system commonly used to understand cancer-like dysregulation of cell function. One type of cancer-related phenotype that is especially readily investigated in frog embryos is the conversion of melanocytes, from normal pigment cells to hyper-proliferative, invasive melanoma-like behavior and induction of cancer markers. While normal melanocytes are also migratory during development, activating them via bioelectric dysregulation of chloride channel function induces a completely different phenotype. [0118] In order to perturb GABA signaling, we used muscimol, a well-known GABA agonist. Muscimol itself is not in the original data we used in our model but it is a GABA(A) agonist like Progabide [79, 80] that was found in the link prediction (see above), enabling us to test the utility of the model’s categorical predictions for novel drugs that it did not have direct experience with. Fifty Xenopus embryos, in triplicate, were exposed to 50 ^^M muscimol between stage 12 to stage 45 (after completion of gastrulation through swimming tadpole stages), and then sectioned. The results are shown in FIG.5; this was an extremely consistent and highly penetrant phenotype observed in 100% of the treated animals macroscopically, and in 10 sectioned (randomly-selected) animals. In contrast to controls, all of the exposed animals exhibited a drastic hyperpigmentation due to the melanocytes’ changing shape (from their normal round form to a much more elongated morphology), and migrating into inappropriate regions that are normally clear. This phenotype has previously been characterized quantitatively with respect to melanocyte shape and number, as well as the expression of cancer-related markers and melanoma-like migration into gut, brain, and vascular tissues. These results confirm the prediction of the model and link GABA-modulating drugs to a cancer-like phenotype in the absence of classic carcinogens, oncogenes, or DNA damage as the initiator of cellular conversion. [0119] Discussion [0120] In this work, we presented what is, to the best of our knowledge, the first universal multiplayer network embedding method with no limitations on number of multiplex and bipartite networks, thanks to recent developments in RWR. We applied MultiXVERSE to a multilayer network containing gene, drug, and diseases interactions and evaluated the quality of the embeddings using both clustering and link prediction. Our model predicted links between cancer, serotonin, and GABA via clustering. The serotonin link has been validated by published data; likewise, the GABA link has been studied before, suggesting that GABA has a driver role in controlling stem and proliferative cell state through GHB production in glioma. Also reported have been membrane potential differences and GABA(A) receptor expression in hepatic tumor versus non-tumor stem cells. However, contradictory evidence has been reported showing that GABA could have an inhibitory effect on tumor progression or cell proliferation. We tested the prediction from clustering between GABA and cancer and confirmed in vivo a causal link between GABA and a cancer-like phenotype in a vertebrate model system in vivo, in the absence of classic carcinogens, oncogenes, or DNA damage as the initiator of cellular conversion. These kinds of data have important implications for understanding of cancer etiology and possible normalization efforts using neurotransmitter modulators and need to be tested in preclinical mammalian models next. More research is necessary to understand the impact of GABA on cancer. [0121] Our main domain of application here was drug discovery for cancer, but the embeddings of gene-drug-disease multilayer networks may be used for many other applications. Different questions in network biology could be addressed by finding new genes related to specific diseases, such as the interesting target - the ion channel KCNK13 - revealed by our case study on the progeria cluster. This may be especially relevant due to the recent hypotheses about the role of bioelectric signaling in aging. We also applied link prediction for different drug- disease associations. Our method for drug repurposing has currently one significant limitation: in the case of new drug-target predictions, we don’t know if the drug will activate or inactivate the target. This may be resolved by using directed networks (unfortunately still rare) in the multiplex networks and training link prediction models on the embedding integrating the directional information of the links. It could be interesting too to add neighborhood-level structural representation for predicting new links in the networks, similarly to, or add attention mechanisms for drug repositioning. [0122] The model’s interpretability could be enhanced to better elucidate the decision- making process and underlying biological mechanisms. This is a theoretical problem that includes a large part of the field of machine learning - (neural) network interpretability. We focus here more on the biological usability of the method and empirical results, which will be a good addition to the field of network interpretability in biology and AI as a whole. However, we have to emphasize that the interpretability of our method probably does not have the same degree of complexity of deep learning, which gives cause for optimism for interpretability in the future. [0123] To improve the capabilities of Large Language Models (LLMs) in processing text-enriched images, researchers have developed embeddings specifically designed to capture image contexts. These embeddings are integrated as soft prompt inputs in LLMs, enhancing the models’ ability to effectively handle visual information. Similarly, we could use the node embeddings as input to use the power of LLMs for generative drug discovery including the richness of multiplex-heterogeneous network biological data. One relevant effort that can be integrated into this framework in the future is the bioinformatics of shape, which seeks to formalize and make amenable to machine learning data on large-scale anatomical outcomes in embryogenesis, regeneration, cancer, and bioengineering. We expect that future systems that combine pattern inference with diverse multi-modal datasets, comprising physiological, anatomical, and molecular-biological data will be a critical aid to human scientists and clinicians seeking to develop interventions for a wide range of biomedical applications. [0124] Conclusion [0125] The development of MultiXVERSE represents a significant advancement in the integration and analysis of multilayer networks for biological research. By providing a universal, scalable framework for multilayer network embedding, MultiXVERSE enables the systematic exploration of molecular and phenotypic interactions across diverse biological contexts. Our experimental validation of the predicted link between GABA and cancer using Xenopus laevis underscores its capability to generate biologically meaningful hypotheses and accelerate breakthroughs in multi-omics research. [0126] Future directions include applying MultiXVERSE to additional multi-omics datasets and integrating it with high-throughput experimental pipelines for systematic hypothesis generation and validation, particularly in drug discovery. Beyond its biological applications, MultiXVERSE is a versatile tool that can be utilized for analyzing multilayer networks in a wide range of fields, including social sciences and other complex systems. By offering a universal framework, MultiXVERSE paves the way for novel insights and interdisciplinary collaborations in multilayer network research. [0127] References for Example 1 [0128] 1.Kivelä M, Arenas A, Barthelemy M, Gleeson JP, Moreno Y, Porter MA. Multilayer networks. J Complex Netw.2014;2(3):203–71. [0129] 2.Zitnik M, Li MM, Wells A, Glass K, Gysi DM, Krishnan A, Murali T, Radivojac P, Roy S, Baudot A, et al. Current and future directions in network biology, arXiv preprint arXiv:2309.08478, 2023. [0130] 3.Liao L, He X, Zhang H, Chua T-S. Attributed social network embedding. IEEE Trans Knowl Data Eng.2018;30(12):2257–70. [0131] 4.Ma G, Lu C-T, He L, Philip SY, Ragin AB. Multi-view graph embedding with hub detection for brain network analysis. In 2017 IEEE International Conference on Data Mining (ICDM), pp.967–972, IEEE, 2017. [0132] 5.Csermely P, Kunsic N, Mendik P, Kerestély M, Faragó T, Veres DV, Tompa P. Learning of signaling networks: molecular mechanisms. Trends Biochem Sci.2020;45(4):284– 94. [0133] 6.Kovács IA, Mizsei R, Csermely P. A unified data representation theory for network visualization, ordering and coarse-graining. Sci Rep.2015;5(1):13786. [0134] 7.Nelson W, Zitnik M, Wang B, Leskovec J, Goldenberg A, Sharan R. To embed or not: network embedding as a paradigm in computational biology, Frontiers in genetics, 10, 2019. [0135] 8.Pio-Lopez L, Valdeolivas A, Tichit L, Remy É, Baudot A. Multiverse: a multiplex and multiplex-heterogeneous network embedding approach. Sci Rep.2021;11(1):1–20. [0136] 9.Valdeolivas A, Tichit L, Navarro C, Perrin S, Odelin G, Levy N, Cau P, Remy E, Baudot A. Random walk with restart on multiplex and heterogeneous biological networks. Bioinformatics.2018;35(3):497–505. [0137] 10.Tsitsulin A, Mottin D, Karras P, Müller E. Verse: Versatile graph embeddings from similarity measures. In Proceedings of the 2018 World Wide Web Conference, pp.539–548, International World Wide Web Conferences Steering Committee, 2018. [0138] 11.Baptista A, Gonzalez A, Baudot A. Universal multilayer network exploration by random walk with restart. Commun Phys.2022;5(1):170. [0139] 12.Bang D, Lim S, Lee S, Kim S. Biomedical knowledge graph learning for drug repurposing by extending guilt-by-association to multiple layers. Nat Commun. 2023;14(1):3570. [0140] 13.Middleton L, Melas I, Vasavda C, Raies A, Rozemberczki B, Dhindsa RS, Dhindsa JS, Weido B, Wang Q, Harper AR, et al. Phenome-wide identification of therapeutic genetic targets, leveraging knowledge graphs, graph neural networks, and uk biobank data, Science Advances, 10(19), p. eadj1424, 2024. [0141] 14.Huang K, Chandak P, Wang Q, Havaldar S, Vaid A, Leskovec J, Nadkarni GN, Glicksberg B. S, Gehlenborg N, Zitnik M. A foundation model for clinician-centered drug repurposing, Nature Medicine, pp.1–13, 2024. [0142] 15.Li MM, Huang Y, Sumathipala M, Liang MQ, Valdeolivas A, Ananthakrishnan AN, Liao K, Marbach D, Zitnik M. Contextual ai models for single-cell protein biology. Nat Methods.2024;21(8):1546–57. [0143] 16.Hu Y, Oleshko S, Firmani S, Zhu Z, Cheng H, Ulmer M, Arnold M, Colomé- Tatché M, Tang J, Xhonneux S, et al. Path-based reasoning for biomedical knowledge graphs with biopathnet, bioRxiv, 2024. [0144] 17.Jiménez A, Merino MJ, Parras J, Zazo S. Explainable drug repurposing via path based knowledge graph completion. Sci Rep.2024;14(1):16587. [0145] 18.Zhang H, Qiu L, Yi L, Song Y. Scalable multiplex network embedding. IJCAI. 2018;18:3082–8. [0146] 19.Bagavathi A, Krishnan S, Multi-net: A scalable multiplex network embedding framework. In International Conference on Complex Networks and their Applications, pp.119– 131, Springer, 2018. [0147] 20.Jiang S-H, Hu L-P, Wang X, Li J, Zhang Z-G. Neurotransmitters: emerging targets in cancer. Oncogene.2020;39(3):503–15. [0148] 21.Mancusi R, Monje M. The neuroscience of cancer. Nature. 2023;618(7965):467–79. [0149] 22.Qu M, Tang J, Shang J, Ren X, Zhang M, Han J. An attention-based collaboration framework for multi-view network representation learning. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp.1767–1776, 2017. [0150] 23.Shi Y, Han F, He X, He X, Yang C, Luo J, Han J. mvn2vec: preservation and collaboration in multi-view network embedding,” arXiv preprint arXiv:1801.06597, 2018. [0151] 24.Liao J, Yan J, Tao Q. Dualhgnn: A dual hypergraph neural network for semi- supervised node classification based on multi-view learning and density awareness. In 2023 International Joint Conference on Neural Networks (IJCNN), pp.1–10, IEEE, 2023. [0152] 25.Xue H, Yang L, Rajan V, Jiang W, Wei Y, Lin Y. Multiplex bipartite network embedding using dual hypergraph convolutional networks. Proc Web Conf.2021;2021:1649–60. [0153] 26.Liu Z, Huang C, Yu Y, Fan B, Dong J. Fast attributed multiplex heterogeneous network embedding. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management, pp.995–1004, 2020. [0154] 27.Cen Y, Zou X, Zhang J, Yang H, Zhou J, Tang J. Representation learning for attributed multiplex heterogeneous network. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pp.1358–1368, 2019. [0155] 28.Yun S, Jeong M, Kim R, Kang J, Kim HJ. Graph transformer networks. Adv Neural Inf Proc Syst, 32, 2019. [0156] 29.Fu C, Yu P, Yu Y, Huang C, Zhao Z, Dong J. Mhgcn+: multiplex heterogeneous graph convolutional network. ACM Trans Intell Syst Technol.2024;15(3):1–25. [0157] 30.Gutmann M, Hyvärinen A. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp.297–304, 2010. [0158] 31.Mnih A, Kavukcuoglu K. Learning word embeddings efficiently with noise- contrastive estimation. Advances in neural information processing systems, vol.26, 2013. [0159] 32.Croft D, Mundo AF, Haw R, Milacic M, Weiser J, Wu G, Caudy M, Garapati P, Gillespie M, Kamdar MR, et al. The reactome pathway knowledgebase. Nucleic Acids Res. 2014;42(D1):D472–7. [0160] 33.Pratt D, Chen J, Welker D, Rivas R, Pillich R, Rynkov V, Ono K, Miello C, Hicks L, Szalma S, et al. Ndex, the network data exchange. Cell Syst.2015;1(4):302–5. [0161] 34.Drew K, Lee C, Huizar RL, Tu F, Borgeson B, McWhite CD, Ma Y, Wallingford JB, Marcotte E.M. Integration of over 9,000 mass spectrometry experiments builds a global map of human protein complexes. Molecular Syst Biol, 13(6), 2017. [0162] 35.Giurgiu M, Reinhard J, Brauner B, Dunger-Kaltenbach I, Fobo G, Frishman G, Montrone C, Ruepp A. Corum: the comprehensive resource of mammalian protein complexes- 2019. Nucleic Acids Res.2019;47(D1):D559–63. [0163] 36.Cheng F, Kovács IA, Barabási A-L. Network-based prediction of drug combinations. Nat Commun.2019;10(1):1–11. [0164] 37.Zhou X, Menche J, Barabási A-L, Sharma A. Human symptoms-disease network. Nat Commun.2014;5(1):1–10. [0165] 38.Jensen AB, Moseley PL, Oprea TI, Ellesøe SG, Eriksson R, Schmock H, Jensen PB, Jensen LJ, Brunak S. Temporal disease trajectories condensed from population-wide registry data covering 6.2 million patients. Nat Commun.2014;5(1):1–10. [0166] 39.Pio-Lopez L. Drug Repositioning Using Multiplex-Heterogeneous Network Embedding: A Case Study on SARS-CoV2. In Complex Networks & Their Applications X (R. M. Benito, C. Cherifi, H. Cherifi, E. Moro, L. M. Rocha, and M. Sales-Pardo, eds.), (Cham), pp. 731–741, Springer International Publishing, 2022. [0167] 40.Blackiston D, Adams DS, Lemire JM, Lobikin M, Levin M. Transmembrane potential of glycl-expressing instructor cells induces a neoplastic-like conversion of melanocytes via a serotonergic pathway. Disease Models Mech.2011;4(1):67–85. [0168] 41.Lobo D, Lobikin M, Levin M. Discovering novel phenotypes with automatically inferred dynamic models: a partial melanocyte conversion in xenopus. Sci Rep. 2017;7(1):41339. [0169] 42.Lobikin M, Lobo D, Blackiston DJ, Martyniuk CJ, Tkachenko E, Levin M. Serotonergic regulation of melanocyte conversion: A bioelectrically regulated network for stochastic all-or-none hyperpigmentation. Sci Signaling, 8(397), 2015. [0170] 43.Buchta C, Kober M, Feinerer I, Hornik K. Spherical k-means clustering. J Statist Softw.2012;50(10):1–22. [0171] 44.Sive HL, Grainger RM, Harland RM. Early development of Xenopus laevis: a laboratory manual. New York: Cold Spring Harbor Laboratory Press; 2000. [0172] 45.Nieuwkoop PD, Faber J. Normal table of Xenopus laevis (Daudin). A systematical and chronological survey of the development from the fertilized egg till the end of metamorphosis. Amsterdam: North-Holland Publishing Company, 1967. [0173] 46.Varela I, Cadinanos J, Pendás AM, Gutiérrez-Fernández A, Folgueras AR, Sánchez LM, Zhou Z, Rodríguez FJ, Stewart CL, Vega JA, et al. Accelerated ageing in mice deficient in zmpste24 protease is linked to p53 signalling activation. Nature. 2005;437(7058):564–8. [0174] 47.Worman HJ, Michaelis S. Prelamin a and zmpste24 in premature and physiological aging. Nucleus.2023;14(1):2270345. [0175] 48.van Tintelen JP, Hofstra RM, Katerberg H, Rossenbacker T, Wiesfeld AC, du Marchie Sarvaas GJ, Wilde AA, van Langen IM, Nannenberg EA, van der Kooi AJ, et al. High yield of lmna mutations in patients with dilated cardiomyopathy and/or conduction disease referred to cardiogenetics outpatient clinics. Am Heart J.2007;154(6):1130–9. [0176] 49.Lo C-Y, Tjong Y-W, Ho JC-Y, Siu C-W, Cheung S-Y, Tang NL, Yu S, Tse H-F, Yao X. An upregulation in the expression of vanilloid transient potential channels 2 enhances hypotonicity-induced cytosolic ca2+ rise in human induced pluripotent stem cell model of hutchinson gillford progeria. PLoS One.2014;9(1): e87273. [0177] 50.Pio-Lopez L, Levin M. Aging as a loss of morphostatic information: a developmental bioelectricity perspective. Ageing Res Rev, p.102310, 2024. [0178] 51.Szenajch J, Wcislo G, Jeong J-Y, Szczylik C, Feldman L. The role of erythropoietin and its receptor in growth, survival and therapeutic response of human tumor cells: from clinic to bench-a critical review. Biochimica et Biophysica Acta (BBA)-Reviews on Cancer, 1806(1), 82–95, 2010. [0179] 52.Roseti C, Cifelli P, Ruffolo G, Barbieri E, Guescini M, Esposito V, Di Gennaro G, Limatola C, Giovannelli A, Aronica E, et al. Erythropoietin increases gabaa currents in human cortex from tle patients. Neuroscience.2020;439:153–62. [0180] 53.Rose M, Kam P. Gabapentin: pharmacology and its use in pain management. Anaesthesia.2002;57(5):451–62. [0181] 54.Costa B, Vale N. Understanding lamotrigine’s role in the cns and possible future evolution. Int J Mol Sci.2023;24(7):6050. [0182] 55.Reid JG, Gitlin MJ, Altshuler LL. Lamotrigine in psychiatric disorders. J Clin Psychiatry.2013;74(7):675–84. [0183] 56.Liu M, Wei W, Stone CR, Zhang L, Tian G, Ding JN. Beneficial effects of trimetazidine on expression of serotonin and serotonin transporter in rats with myocardial infarction and depression. Neuropsychiatric Disease Treatment, pp.787–797, 2018. [0184] 57.Huang W, Cao L. Targeting gaba signalling for cancer treatment. Nat Cell Biology.2022;24(2):131–2. [0185] 58.Jayachandran P, Battaglin F, Strelez C, Lenz A, Algaze S, Soni S, Lo JH, Yang Y, Millstein J, Zhang W, et al. Breast cancer and neurotransmitters: emerging insights on mechanisms and therapeutic directions. Oncogene.2023;42(9):627–37. [0186] 59.Huang D, Wang Y, Thompson JW, Yin T, Alexander PB, Qin D, Mudgal P, Wu H, Liang Y, Tan L, et al. Cancer-cell-derived gaba promotes -catenin-mediated tumour growth and immunosuppression. Nat Cell Biol.2022;24(2):230–41. [0187] 60.Azuma H, Inamoto T, Sakamoto T, Kiyama S, Ubai T, Shinohara Y, Maemura K, Tsuji M, Segawa N, Masuda H, et al. -aminobutyric acid as a promoting factor of cancer metastasis; induction of matrix metalloproteinase production is potentially its underlying mechanism. Cancer Res.2003;63(23):8090–6. [0188] 61.El-Habr EA, Dubois LG, Burel-Vandenbos F, Bogeas A, Lipecka J, Turchi L, Lejeune F-X, Coehlo PLC, Yamaki T, Wittmann BM, et al. A driver role for gaba metabolism in controlling stem and proliferative cell state through ghb production in glioma. Acta Neuropathologica.2017;133:645–60. [0189] 62.Bautista W, Perez-Alvarez V, Burczynski F, Raouf A, Klonisch T, Minuk G. Membrane potential differences and gabaa receptor expression in hepatic tumor and non-tumor stem cells. Canadian J Physiol Pharmacol.2014;92(1):85–91. [0190] 63.Schuller HM, Al-Wadei HA, Majidi M. Gamma-aminobutyric acid, a potential tumor suppressor for small airway-derived lung adenocarcinoma. Carcinogenesis. 2008;29(10):1979–85. [0191] 64.Ortega A. A new role for gaba: inhibition of tumor cell migration. Trends Pharmacol Sci.2003;24(4):151–4. [0192] 65.Finnimore A, Roebuck M, Sajkov D, McEvoy R. The effects of the gaba agonist, baclofen, on sleep and breathing. Eur Respir J.1995;8(2):230–4. [0193] 66.Noguchi H, Kitazumi K, Mori M, Shiba T. Binding and neuropharmacological profile of zaleplon, a novel nonbenzodiazepine sedative/hypnotic. Eur J Pharmacol.2002;434(1– 2):21–8. [0194] 67.Huddart R, Leeder JS, Altman RB, Klein TE. Pharmgkb summary: clobazam pathway, pharmacokinetics. Pharmacogenetics Genomics.2018;28(4):110–5. [0195] 68.Bergmann KJ. Progabide: a new gaba-mimetic agent in clinical use. Clin Neuropharmacology.1985;8(1):13–26. [0196] 69.Sanna E, Busonero F, Talani G, Carta M, Massa F, Peis M, Maciocco E, Biggio G. Comparison of the effects of zaleplon, zolpidem, and triazolam at various gabaa receptor subtypes. Eur J Pharmacol.2002;451(2):103–10. [0197] 70.Hardwick LJ, Philpott A. An oncologist’s friend: How xenopus contributes to cancer research. Dev Biol.2015;408(2):180–7. [0198] 71.Hardwick LJ, Philpott A. Xenopus models of cancer: expanding the oncologist’s toolbox. Front Physiol.2018;9: 424568. [0199] 72.Lobikin M, Chernet B, Lobo D, Levin M. Resting potential, oncogene-induced tumorigenesis, and metastasis: the bioelectric basis of cancer in vivo. Phys Biol.2012;9(6): 065002. [0200] 73.Morokuma J, Blackiston D, Adams DS, Seebohm G, Trimmer B, Levin M. Modulation of potassium channel function confers a hyperproliferative invasive phenotype on embryonic stem cells. Proc Nat Acad Sci.2008;105(43):16608–13. [0201] 74.Benkherouf AY, Taina K-R, Meera P, Aalto AJ, Li X-G, Soini SL, Wallner M, Uusi-Oukari M. Extrasynaptic -gabaa receptors are high-affinity muscimol receptors. J Neurochem.2019;149(1):41–53. [0202] 75.Johnston GA. Muscimol as an ionotropic gaba receptor agonist. Neurochem Res.2014;39:1942–7. [0203] 76.Leach MJ, Wilson JA. Gaba receptor binding with 3h-muscimol in calf cerebellum. Eur J Pharmacol.1978;48(3):329–30. [0204] 77.Scheel-Krüger J, Cools AR, Van Wel PM. Muscimol a gaba-agonist injected into the nucleus accumbens increases apomorphine stereotypy and decreases the motility. Life Sci.1977;21(11):1697–702. [ [0205] 78.Shoulson I, Goldblatt D, Charlton M, Joynt RJ. Huntington’s disease: treatment with muscimol, a gaba-mimetic drug. Ann Neurol: Official J Am Neurol Assoc Child Neurol Soc.1978;4(3):279–84. [0206] 79.Bartholini G. Pharmacology of the gabaergic system: effects of progabide, a gaba receptor agonist. Psychoneuroendocrinology.1984;9(2):135–40. [0207] 80.Wahab A, Heinemann U, Albus K. Effects of -aminobutyric acid (gaba) agonists and a gaba uptake inhibitor on pharmacoresistant seizure like events in organotypic hippocampal slice cultures. Epilepsy Res.2009;86(2–3):113–23. [0208] 81.Maffini MV, Calabro JM, Soto AM, Sonnenschein C. Stromal regulation of neoplastic development: age-dependent normalization of neoplastic mammary cells by mammary stroma. Am J Pathol.2005;167(5):1405–10. [0209] 82.Kasemeier-Kulesa JC, Teddy JM, Postovit L-M, Seftor EA, Seftor RE, Hendrix MJ, Kulesa PM. Reprogramming multipotent tumor cells with the embryonic neural crest microenvironment. Dev Dyn.2008;237(10):2657–66. [0210] 83.Mintz B, Illmensee K. Normal genetically mosaic mice produced from malignant teratocarcinoma cells. Proc Nat Acad Sci.1975;72(9):3585–9. [0211] 84.Anderson B. Bioelectricity: a top-down control model to promote more effective aging interventions. Bioelectricity.2024;6(1):2–12. [0212] 85.Silver BB, Nelson CM. The bioelectric code: reprogramming cancer and aging from the interface of mechanical and chemical microenvironments. Front Cell Dev Biol. 2018;6:21. [0213] 86.Zhao B-W, Su X-R, Yang Y, Li D-X, Li G-D, Hu P-W, Luo X, Hu L. A heterogeneous information network learning model with neighborhood-level structural representation for predicting lncrna-mirna interactions. Comput Struct Biotechnol J. 2024;23:2924–33. [0214] 87.Zhao B-W, Su X-R, Hu P-W, Ma Y-P, Zhou X, Hu L. A geometric deep learning framework for drug repositioning over heterogeneous information networks. Briefings Bioinf, 23,(6), p. bbac384, 2022. [0215] 88.Zhao B-W, Su X-R, Yang Y, Li D-X, Li G-D, Hu P-W, You Z-H, Luo X, Hu L. Regulation-aware graph learning for drug repositioning over heterogeneous biological network. Inf Sci.2025;686: 121360. [0216] 89.Prouteau T, Dugué N, Guillot S. From communities to interpretable network and word embedding: an unified approach. J Complex Netw, 12(6), p. cnae034, 2024. [0217] 90.Zhang Y, Tiňo P, Leonardis A, Tang K. A survey on neural network interpretability. IEEE Trans Emerg Topics Computat Intell.2021;5(5):726–42. [0218] 91.Fan F-L, Xiong J, Li M, Wang G. On interpretability of artificial neural networks: a survey. IEEE Trans Radiation Plasma Med Sci.2021;5(6):741–60. [0219] 92.Zhang X, Zhao Z, Li C, Zhang Y, Zhao J. An interpretable and scalable recommendation method based on network embedding. IEEE Access.2019;7:9384–94. [0220] 93.Hu W, Xu Y, Li Y, Li W, Chen Z, Tu Z. Bliva: A simple multimodal llm for better handling of text-rich visual questions. Proc AAAI Conf Artif Intell.2024;38:2256–64. [0221] 94.Lobo D, Malone TJ, Levin M. Towards a bioinformatics of patterning: a computational approach to understanding regulative morphogenesis. Biol Open.2012;2(2):156– 69. [0222] 95.Lobo D, Feldman EB, Shah M, Malone TJ, Levin M. A bioinformatics expert system linking functional data to anatomical outcomes in limb regeneration. Regeneration. 2014;1(2):37–56. [0223] Example 2 [0224] FIG.6 shows an example process 600 to characterize a multilayer network. At step 602, a network is obtained. The network may be a multilayer network made of a plurality of multiplex heterogeneous networks and a plurality of bipartite networks connecting the multiplex heterogeneous networks in the plurality of multiple heterogeneous networks. In some embodiments, each multiple heterogeneous network in the plurality of multiple heterogeneous networks includes a plurality of nodes. At step 604, a trained machine learning algorithm may be applied to relate the plurality of multiplex heterogeneous networks to an embedding space. The trained machine learning algorithm may use random walk with restart. At step 606, the embedding space of the plurality of multiplex heterogeneous networks can be used to extract information to corresponding to interactions between the plurality of nodes of each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks. As step 608, the embedding space and extracted information may be output. In some embodiments, the information may be output to a user. [0225] In FIG.7, an example 700 of a system (e.g., a data processing system) for characterizing a multilayer network is provided. [0226] In some embodiments, computing device 704 and/or server 716 can be any suitable computing device or combination of devices, such as a desktop computer, a laptop computer, a smartphone, a tablet computer, a wearable computer, a server computer, a virtual machine being executed by a physical computing device, etc. As described herein, system 700 can present information about the characterized multilayer network to a user (e.g., a researcher and/or a physician). [0227] In some embodiments, communication network 702 can be any suitable communication network or combination of communication networks. In some embodiments, communication network 702 can be any suitable communication network or combination of communication networks. For example, communication network 702 can include a Wi-Fi network (which can include one or more wireless routers, one or more switches, etc.), a peer-to- peer network (e.g., a Bluetooth network), a cellular network (e.g., a 4G network, a 5G network, etc., complying with any suitable standard, such as CDMA, GSM, LTE, LTE Advanced, WiMAX, etc.), a wired network, etc. In some embodiments, communication network 702 can be a local area network, a wide area network, a public network (e.g., the Internet), a private or semi- private network (e.g., a corporate or university intranet), any other suitable type of network, or any suitable combination of networks. Communications links shown in FIG.7 can each be any suitable communications link or combination of communications links, such as wired links, fiber optic links, Wi-Fi links, Bluetooth links, cellular links, etc. [0228] FIG.7 additionally shows an example of hardware that can be used to implement computing device 704 and server 716 in accordance with some embodiments of the disclosed subject matter. In some embodiments, computing device 704 can be used to execute one or more set of instructions to identify a behavioral catalog. In other embodiments, computing device 704 can be used to characterize a multilayer network. [0229] As shown in FIG.7, computing device 704 can include one or more hardware processor 706, one or more displays 708, one or more inputs 710, one or more communications 712, and/or memory 714. In some embodiments, processor 706 can be any suitable hardware processor or combination of processors, such as central processing unit, a graphics processing unit, etc. In some embodiments, display 708 can include any suitable display devices, such as a computer monitor, a touchscreen, a television, etc. In some embodiments, inputs 710 can include any suitable input device and/or sensors that can be used to receive user input, such as a keyboard, a mouse, a touchscreen, a microphone, etc. [0230] In some embodiments, communication systems 712 can include any suitable hardware, firmware, and/or software for communicating information over communication network 702 and/or any other suitable communication networks. For example, communications systems 712 can include one or more transceivers, one or more communication chips and/or chip sets, etc. In a more particular example, communications systems 712 can include hardware, firmware and/or software that can be used to establish a Wi-Fi connection, a Bluetooth connection, a cellular connection, an Ethernet connection, etc. [0231] In some embodiments, memory 714 can include any suitable storage device or devices that can be used to store instructions, values, etc., that can be used, for example, by processor 706 to present content using display 708, to communicate with server 716 via communications system(s) 712, etc. [0232] Memory 714 can include any suitable volatile memory, non-volatile memory, storage, or any suitable combination thereof. For example, memory 714 can include RAM, ROM, EEPROM, one or more flash drives, one or more hard disks, one or more solid state drives, one or more optical drives, etc. In some embodiments, memory 714 can have encoded thereon a computer program for controlling operation of computing device 704. In such embodiments, processor 706 can execute at least a portion of the computer program to present content (e.g., images, user interfaces, graphics, tables, etc.), receive content from server 716, transmit information to server 716, etc. [0233] In some embodiments, server 716 can include a processor 718, a display 720, one or more inputs 722, one or more communications systems 724, and/or memory 726. In some embodiments, processor 718 can be any suitable hardware processor or combination of processors, such as a central processing unit, a graphics processing unit, etc. In some embodiments, display 720 can include any suitable display devices, such as a computer monitor, a touchscreen, a television, etc. In some embodiments, inputs 722 can include any suitable input devices and/or sensors that can be used to receive user input, such as a keyboard, a mouse, a touchscreen, a microphone, etc. [0234] In some embodiments, communications systems 724 can include any suitable hardware, firmware, and/or software for communicating information over communication network 702 and/or any other suitable communication networks. For example, communications systems 724 can include one or more transceivers, one or more communication chips and/or chip sets, etc. In a more particular example, communications systems 724 can include hardware, firmware and/or software that can be used to establish a Wi-Fi connection, a Bluetooth connection, a cellular connection, an Ethernet connection, etc. [0235] In some embodiments, memory 726 can include any suitable storage device or devices that can be used to store instructions, values, etc., that can be used, for example, by processor 718 to present content using display 720, to communicate with one or more computing devices 704, etc. Memory 726 can include any suitable volatile memory, non-volatile memory, storage, or any suitable combination thereof. For example, memory 726 can include RAM, ROM, EEPROM, one or more flash drives, one or more hard disks, one or more solid state drives, one or more optical drives, etc. In some embodiments, memory 726 can have encoded thereon a server program for controlling operation of server 716. In such embodiments, processor 718 can execute at least a portion of the server program to transmit information and/or content (e.g., results of a tissue identification and/or classification, a user interface, etc.) to one or more computing devices 704, receive information and/or content from one or more computing devices 704, receive instructions from one or more devices (e.g., a personal computer, a laptop computer, a tablet computer, a smartphone, etc.), etc. [0236] In some embodiments, any suitable computer readable media can be used for storing instructions for performing the functions and/or processes described herein. For example, in some embodiments, computer readable media can be transitory or non-transitory. For example, non-transitory computer readable media can include media such as magnetic media (such as hard disks, floppy disks, etc.), optical media (such as compact discs, digital video discs, Blu-ray discs, etc.), semiconductor media (such as RAM, Flash memory, electrically programmable read only memory (EPROM), electrically erasable programmable read only memory (EEPROM), etc.), any suitable media that is not fleeting or devoid of any semblance of permanence during transmission, and/or any suitable tangible media. As another example, transitory computer readable media can include signals on networks, in wires, conductors, optical fibers, circuits, or any suitable media that is fleeting and devoid of any semblance of permanence during transmission, and/or any suitable intangible media. [0237] A number of references to patent and non-patent documents are made throughout the publication, each of which is herein incorporated by reference in its entirety. [0238] Thus, while the invention has been described above in connection with particular embodiments and examples, the invention is not necessarily so limited, and that numerous other embodiments, examples, uses, modifications and departures from the embodiments, examples and uses are intended to be encompassed by the claims attached hereto.

Claims

CLAIMS What is claimed is: 1. A method for characterizing a multilayer network, comprising: obtaining a multilayer network comprising a plurality of multiplex heterogenous networks and a plurality of bipartite networks connecting the multiplex heterogeneous networks in the plurality of multiplex heterogeneous networks, wherein each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks comprises a plurality of nodes; applying a trained machine learning algorithm to relate the plurality of multiplex heterogeneous networks to an embedding space, wherein the trained machine learning algorithm uses random walk with restart; using the embedding space of the plurality of multiplex heterogeneous networks to extract information corresponding to interactions between the plurality of nodes of each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks; and outputting the embedding space and the extracted information to a user.
2. The method of claim 1, wherein at least one multiplex heterogenous network in the plurality of multiplex heterogenous networks comprises weighted or directed nodes, and wherein at least one multiplex heterogenous network in the plurality of multiplex heterogenous networks comprises undirected nodes.
3. The method of claim 1 or 2, wherein the multilayer network further comprises at least one monoplex network.
4. The method of any one of the preceding claims, wherein the plurality of multiplex heterogenous networks comprises more than three multiplex heterogenous networks and the plurality of bipartite networks comprises more than three bipartite networks.
5. The method of any one of the preceding claims, wherein each multiplex heterogenous network in the plurality of multiplex heterogenous networks corresponds to a dataset.
6. The method of claim 5, wherein the corresponding dataset is at least one of a molecular dataset, a drug dataset, or a disease dataset.
7. The method of claim 6, wherein the method is used to identify at least one drug related to a disease.
8. The method of claim 6, wherein the method is used to identify a drug for repurposing.
9. The method of claim 6, wherein the method is used to characterize drug synergies.
10. The method of any one of the preceding claims, wherein the method is used to characterize a system related to at least one of birth defects, regeneration, aging, degeneration, cancer, neurological disease, in vitro bioengineering, synthetic food production, or agriculture.
11. The method of any one of the preceding claims, wherein the method further comprises interpreting the output embedding space and extracted information using at least one machine learning technique.
12. A system for characterizing a multilayer network, comprising: a processor in communication with a memory, the memory having stored thereon a set of instructions which, when executed by the processor, cause the processor to: obtain a multilayer network comprising a plurality of multiplex heterogenous networks and a plurality of bipartite networks connecting the multiplex heterogeneous networks in the plurality of multiplex heterogeneous networks, wherein each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks comprises a plurality of nodes; apply a trained machine learning algorithm to relate the plurality of multiplex heterogeneous networks to an embedding space, wherein the trained machine learning algorithm uses random walk with restart; use the embedding space of the plurality of multiplex heterogeneous networks to extract information corresponding to interactions between the plurality of nodes of each multiplex heterogeneous network in the plurality of multiplex heterogeneous networks; and output the embedding space and the extracted information to a user.
13. The system of claim 12, wherein at least one multiplex heterogenous network in the plurality of multiplex heterogenous networks comprises weighted or directed nodes, and wherein at least one multiplex heterogenous network in the plurality of multiplex heterogenous networks comprises undirected nodes.
14. The system of claim 12 or 13, wherein the multilayer network further comprises at least one monoplex network.
15. The system of any one of claims 12-14, wherein each multiplex heterogenous network in the plurality of multiplex heterogenous networks corresponds to a dataset.
16. The system of claim 15, wherein the corresponding dataset is at least one of a molecular dataset, a drug dataset, or a disease dataset.
17. The system of claim 16, wherein the method is used to identify at least one drug related to a disease.
18. The system of claim 16, wherein the method is used to identify a drug for repurposing.
19. The system of claim 17, wherein the method is used to characterize drug synergies.
20. The system of any one of claims 12-19, wherein the system is used to characterize a system related to at least one of birth defects, regeneration, aging, degeneration, cancer, neurological disease, in vitro bioengineering, synthetic food production, or agriculture.
21. The system of any one of claims 12-20, wherein the system further causes the processor to interpret the output embedding space and extracted information using at least one machine learning technique.
PCT/US2025/035658 2024-06-28 2025-06-27 Methods and systems for modeling multilayer networks Pending WO2026006705A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202463665915P 2024-06-28 2024-06-28
US63/665,915 2024-06-28

Publications (1)

Publication Number Publication Date
WO2026006705A1 true WO2026006705A1 (en) 2026-01-02

Family

ID=98222892

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2025/035658 Pending WO2026006705A1 (en) 2024-06-28 2025-06-27 Methods and systems for modeling multilayer networks

Country Status (1)

Country Link
WO (1) WO2026006705A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140317033A1 (en) * 2013-04-23 2014-10-23 International Business Machines Corporation Predictive and descriptive analysis on relations graphs with heterogeneous entities
US20150317376A1 (en) * 2014-05-01 2015-11-05 International Business Machines Corporation Method, system and computer program product for automating expertise management using social and enterprise data
US20220374812A1 (en) * 2021-05-24 2022-11-24 Skillsacpe Analytics LLC Systems and methods for generation and traversal of a skill representation graph using machine learning

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140317033A1 (en) * 2013-04-23 2014-10-23 International Business Machines Corporation Predictive and descriptive analysis on relations graphs with heterogeneous entities
US20150317376A1 (en) * 2014-05-01 2015-11-05 International Business Machines Corporation Method, system and computer program product for automating expertise management using social and enterprise data
US20220374812A1 (en) * 2021-05-24 2022-11-24 Skillsacpe Analytics LLC Systems and methods for generation and traversal of a skill representation graph using machine learning

Similar Documents

Publication Publication Date Title
Ma et al. Integrative methods and practical challenges for single-cell multi-omics
Wang et al. CoSpar identifies early cell fate biases from single-cell transcriptomic and lineage information
Tyler et al. PyMINEr finds gene and autocrine-paracrine networks from human islet scRNA-Seq
Hartl et al. Coexpression network architecture reveals the brain-wide and multiregional basis of disease susceptibility
Rozenblatt-Rosen et al. The Human Cell Atlas: from vision to reality
Bumbarger et al. System-wide rewiring underlies behavioral differences in predatory and bacterial-feeding nematodes
Lobo et al. Towards a bioinformatics of patterning: a computational approach to understanding regulative morphogenesis
Wang et al. SpatialAgent: An autonomous AI agent for spatial biology
Özsezen et al. Inference of the high-level interaction topology between the metabolic and cell-cycle oscillators from single-cell dynamics
Cheng et al. Full characterization of localization diversity in the human protein interactome
Hong et al. eMCI: An explainable multimodal correlation integration model for unveiling spatial transcriptomics and intercellular signaling
Fdez-Riverola et al. Practical Applications of Computational Biology and Bioinformatics, 12th International Conference
Wang et al. A single-cell omics network model of cell crosstalk during the formation of primordial follicles
Wu et al. Discovery and generalization of tissue structures from spatial omics data
Wang et al. Bioinformatics for plant and agricultural discoveries in the age of multiomics: A review and case study of maize nodal root growth under water deficit
Dasgupta et al. Single-cell RNA sequencing: a new window into cell scale dynamics
Duan et al. imiracle: An iterative multi-view graph neural network to model intercellular gene regulation from spatial transcriptomic data
WO2026006705A1 (en) Methods and systems for modeling multilayer networks
Pio-Lopez et al. Universal multilayer network embedding reveals a causal link between GABA neurotransmitter and cancer: L. Pio-Lopez, M. Levin
Wang et al. A variational graph-partitioning approach to modeling protein liquid-liquid phase separation
Levin et al. Technological approach to mind everywhere: a framework for conceptualizing goal-directedness in biology and other domains
Sikkander et al. Intelligent visualization frameworks driven by AI for multi-dimensional genomic data exploration and interpretation
Castoe et al. A vision for VenomsBase: an integrated knowledgebase for the study of venoms and their applications
Kumbier et al. Signed iterative random forests to identify enhancer-associated transcription factor binding
Mori et al. eSPRESSO: topological clustering of single-cell transcriptomics data to reveal informative genes for spatio–temporal architectures of cells

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 25827615

Country of ref document: EP

Kind code of ref document: A1