Paruchuri et al., 2024 - Google Patents
What are the odds? language models are capable of probabilistic reasoningParuchuri et al., 2024
View PDF- Document ID
- 14748756813514980443
- Author
- Paruchuri A
- Garrison J
- Liao S
- Hernandez J
- Sunshine J
- Althoff T
- Liu X
- McDuff D
- Publication year
- Publication venue
- arXiv preprint arXiv:2406.12830
External Links
Snippet
Language models (LM) are capable of remarkably complex linguistic tasks; however, numerical reasoning is an area in which they frequently struggle. An important but rarely evaluated form of reasoning is understanding probability distributions. In this paper, we …
- 238000009826 distribution 0 abstract description 257
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Sharma et al. | The truth is in there: Improving reasoning in language models with layer-selective rank reduction | |
US20190370684A1 (en) | System for automatic, simultaneous feature selection and hyperparameter tuning for a machine learning model | |
Saxena et al. | On applying the prognostic performance metrics | |
Paruchuri et al. | What are the odds? language models are capable of probabilistic reasoning | |
Alquier et al. | Prediction of time series by statistical learning: general losses and fast rates | |
Pusponegoro et al. | Linear mixed model for analyzing longitudinal data: A simulation study of children growth differences | |
Liu et al. | Nonstationary bandit learning via predictive sampling | |
Lambert et al. | R∗: A robust MCMC convergence diagnostic with uncertainty using decision tree classifiers | |
Berry et al. | Monte Carlo comparisons of the asymptotic chi-square and likelihood-ratio tests with the nonasymptotic chi-square tests for sparse r× c tables. | |
Holt et al. | Essential Aspects of Bayesian Data Imputation | |
Kapetanios et al. | Modeling structural breaks in economic relationships using large shocks | |
Domański et al. | Outliers in Control Engineering: Fractional Calculus Perspective | |
Walsh et al. | Information theory: Some concepts and measures | |
Škvára et al. | Is AUC the best measure for practical comparison of anomaly detectors? | |
Wang | Employee Salaries Analysis and Prediction with Machine Learning | |
Chen et al. | Fitting your favorite mixed models with PROC MCMC | |
Ku et al. | Testing for stochastic independence: application to blind source separation | |
Wang et al. | Statistical Inference for Networks of High-Dimensional Point Processes | |
Shan et al. | Network resampling for estimating uncertainty | |
Leydesdorff | Relations among science indicators or more generally among anything one might wish to count about texts: II. The dynamics of science | |
Tsang et al. | Nowcasting directional change in high frequency FX markets | |
Zhou et al. | Detecting Errors in a Numerical Response via any Regression Model | |
Wang et al. | Process Duration Modeling and Concept Drift Detection using Phase-Type Distributions | |
Zamri et al. | A review on models for count data with extra zeros | |
Younge et al. | First movers and follow-on invention: evidence from a vector space model of invention |