Partition identifiers¶
When dealing with partitioned datasets, you need to identify or refer to partitions. A partition identifier uniquely identifies a single partition within a dataset.
The identifier of a partition is made by concatenating the dimension values, separated by | (pipe)
Time dimension identifiers¶
The format of a time dimension identifier depends on the time dimension granularity
Period | Format | Example |
---|---|---|
Year | YYYY | 2014 |
Month | YYYY-MM | 2014-01 |
Day | YYYY-MM-DD | 2014-01-17 |
Hour | YYYY-MM-DD-HH | 2014-01-17-13 |
Discrete dimension identifiers¶
The identifier of a discrete dimension value is the value itself. As such, discrete dimensions should only contain letters and numbers.
Multiple dimensions¶
For example, if you have a time dimension named “date” with a “DAY” granularity and a discrete dimension named “country”, then you could have partitions like:
- 2013-01-01|France
- 2013-01-01|Italy
- 2013-01-02|France
- 2013-01-02|Italy
Ranges specifications¶
In various locations in DSS, you can use a “partition range specification” syntax to refer to a set of partitions or values of a dimension.
For example:
- In “Exact values” dependency function
- When building datasets
- When creating scheduled jobs
- …
The generic syntax is:
PARTITION_SPEC = DIMENSION_SPEC|DIMENSION_SPEC|….
DIMENSION_SPEC =
- DATE # Single Date (Time)
- DATE/DATE # Date Range (Time)
- Any # Actual Value (ExactValue)
- Any/Any/Any/… # Several values (ExactValue)
Examples:
2014-01-25/2014-01-28 : Single DAY dimension, select 3 days
2014-01-25-14/2014-01-28-15: Single HOUR dimension, select 73 hours
2014-02/2014-03|FR/IT: One MONTH dimension and one discrete dimension, select a total of 4 partitions:
- 2014-02|FR
- 2014-02|IT
- 2014-03|FR
- 2014-03|IT