[go: up one dir, main page]

Paruchuri et al., 2024 - Google Patents

What are the odds? language models are capable of probabilistic reasoning

Paruchuri et al., 2024

View PDF
Document ID
14748756813514980443
Author
Paruchuri A
Garrison J
Liao S
Hernandez J
Sunshine J
Althoff T
Liu X
McDuff D
Publication year
Publication venue
arXiv preprint arXiv:2406.12830

External Links

Snippet

Language models (LM) are capable of remarkably complex linguistic tasks; however, numerical reasoning is an area in which they frequently struggle. An important but rarely evaluated form of reasoning is understanding probability distributions. In this paper, we …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/02Knowledge representation
    • G06N5/022Knowledge engineering, knowledge acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06K9/6232Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
    • G06K9/6247Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computer systems based on specific mathematical models
    • G06N7/005Probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6267Classification techniques
    • G06K9/6279Classification techniques relating to the number of classes

Similar Documents

Publication Publication Date Title
Sharma et al. The truth is in there: Improving reasoning in language models with layer-selective rank reduction
US20190370684A1 (en) System for automatic, simultaneous feature selection and hyperparameter tuning for a machine learning model
Saxena et al. On applying the prognostic performance metrics
Paruchuri et al. What are the odds? language models are capable of probabilistic reasoning
Alquier et al. Prediction of time series by statistical learning: general losses and fast rates
Pusponegoro et al. Linear mixed model for analyzing longitudinal data: A simulation study of children growth differences
Liu et al. Nonstationary bandit learning via predictive sampling
Lambert et al. R∗: A robust MCMC convergence diagnostic with uncertainty using decision tree classifiers
Berry et al. Monte Carlo comparisons of the asymptotic chi-square and likelihood-ratio tests with the nonasymptotic chi-square tests for sparse r× c tables.
Holt et al. Essential Aspects of Bayesian Data Imputation
Kapetanios et al. Modeling structural breaks in economic relationships using large shocks
Domański et al. Outliers in Control Engineering: Fractional Calculus Perspective
Walsh et al. Information theory: Some concepts and measures
Škvára et al. Is AUC the best measure for practical comparison of anomaly detectors?
Wang Employee Salaries Analysis and Prediction with Machine Learning
Chen et al. Fitting your favorite mixed models with PROC MCMC
Ku et al. Testing for stochastic independence: application to blind source separation
Wang et al. Statistical Inference for Networks of High-Dimensional Point Processes
Shan et al. Network resampling for estimating uncertainty
Leydesdorff Relations among science indicators or more generally among anything one might wish to count about texts: II. The dynamics of science
Tsang et al. Nowcasting directional change in high frequency FX markets
Zhou et al. Detecting Errors in a Numerical Response via any Regression Model
Wang et al. Process Duration Modeling and Concept Drift Detection using Phase-Type Distributions
Zamri et al. A review on models for count data with extra zeros
Younge et al. First movers and follow-on invention: evidence from a vector space model of invention