[go: up one dir, main page]

Reads a dataset from Dataiku's Data Science Studio

Usage

read.dataset(name, partitions = NULL, sampling = NULL, columns = NULL)

Arguments

name
name of dataset
partitions
character vector of partitions to load
sampling
a list generated by the sampling functions, e.g. fixed.sampling
columns
a character vector of columns to read from dataset

Value

A data.frame with the requested data

Description

Reads a dataset from Dataiku's Data Science Studio

Details

Users can specify which partitions and columns to load, as well as a sampling scheme if the dataset is too large to fit into memory

Please note that this function is now deprecated. Please use dkuReadDataset instead.

Examples

## Not run: # d = read.dataset("iris") # # s = fixed.sampling(nbrow=1000) # d = read.dataset("iris", sampling=s) # # d = read.dataset("iris", columns=c("Sepal.Length", "Sepal.Width")) # ## End(Not run)