Get Dataset
Route
/v2/datasets/{id}
Description
Retrieve a specific dataset by its unique identifier. Returns detailed information about the dataset configuration and properties.
Method
GET
Inputs
| Parameter | Type | Required | Description |
|---|---|---|---|
id | string | Yes | The unique identifier of the dataset to retrieve |
Output
| Field | Type | Description |
|---|---|---|
type | string | Always "datasets" |
size | integer | Number of datasets returned (always 1) |
resources | array[Dataset] | Array containing the requested dataset |
errors | array[Error] | Any errors encountered |
Dataset Object Structure
| Field | Type | Description |
|---|---|---|
id | string | Unique dataset identifier |
version | string | Version identifier for concurrency control |
name | string | Human-readable name |
description | string | Detailed description |
sensitivity | string | Sensitivity level (unrestricted, low, moderate, high, critical) |
domain | string | Domain category (corporate, partner, personal, public) |
query | object | Query conditions defining dataset membership |
created_at | string | Creation timestamp (ISO 8601) |
last_modified | string | Last modification timestamp (ISO 8601) |
backfills_enabled | boolean | Whether historical data processing is enabled |
policy_ids | array[string] | Associated policy identifiers |
Rate Limit
- 60 requests per minute per API key
- 5 concurrent requests per endpoint
Example Response
{
"type": "datasets",
"size": 1,
"resources": [
{
"id": "dataset-123",
"version": "v1.2.3",
"name": "Customer PII",
"description": "Personal identifiable information of customers",
"sensitivity": "high",
"domain": "corporate",
"query": {
"rules": [
{
"conditions": [
{
"field_name": "content_tags",
"operator": "has_tag",
"values": [{"value": "customer_pii"}]
}
]
}
]
},
"created_at": "2024-01-15T10:30:00Z",
"last_modified": "2024-01-20T14:15:00Z",
"backfills_enabled": true,
"policy_ids": ["policy-456", "policy-789"]
}
],
"errors": []
}