Skip to content
This repository was archived by the owner on Jun 18, 2024. It is now read-only.

Commit d9e64e5

Browse files
author
Rebecca Williams
committed
Merge pull request #491 from project-open-data/collectionsaddition
First pass at collections guidance
2 parents 8ad5f09 + 63e772e commit d9e64e5

File tree

1 file changed

+76
-0
lines changed

1 file changed

+76
-0
lines changed

v1.1/collections.md

Lines changed: 76 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,76 @@
1+
---
2+
published: true
3+
layout: default
4+
title: Data Collections
5+
permalink: /v1.1/collections/
6+
filename: collections.md
7+
---
8+
9+
Agencies should enrich their Enterprise Data Inventory and Public Data Listing data.json files by ensuring all data assets (including databases and web applications) include metadata for the individual datasets contained within. Including metadata for the individual datasets containted within a larger data asset makes that data more discoverable improving searchability, facilitating use, and providing a comprehensive reference for data management and coordination. The original v1.0 schema did not accommodate data collections, but with the [v1.1 schema](/v1.1/schema/), datasets that belong to a larger collection of data should now use the `isPartOf` field to reference the `identifier` of a parent dataset. The parent dataset acts as a container for all the datasets in the collection so it may not have any distributions associated with it, but it can also be used to list distributions that provide consolidated or merged versions of the datasets in the collection for bulk download.
10+
11+
12+
Project Open Data Standard Metadata Usage Notes for Data Collections
13+
--------------------------------------------------
14+
15+
{: .table .table-striped #identifier}
16+
**Field [#](#identifier){: .permalink}** | **identifier**
17+
----- | -----
18+
**Cardinality** | (1,1)
19+
**Required** | Yes, always
20+
**Accepted Values** | String
21+
**Usage Notes** | This field allows third parties to maintain a consistent record for datasets even if title or URLs are updated. Agencies may integrate an existing system for maintaining unique identifiers. Each identifier **must** be unique across the agency's catalog and remain fixed. It is **highly recommended** that a [URI (preferably an HTTP URL)](http://tools.ietf.org/html/rfc3986#section-1.1.3) be used to provide a globally unique identifier. Identifier URLs should be designed and maintained to persist indefinitely regardless of whether the URL of the resource itself changes.
22+
**Example** | `{"identifier":"http://dx.doi.org/10.7927/H4PZ56R2"}`
23+
24+
{: .table .table-striped #isPartOf}
25+
**Field [#](#isPartOf){: .permalink}** | **isPartOf**
26+
----- | -----
27+
**Cardinality** | (0,1)
28+
**Required** | No
29+
**Accepted Values** | String
30+
**Usage Notes** | This field allows the grouping of multiple datasets into a "collection". This field should be employed by the individual datasets that together make up a collection. The value for this field should match the `identifier` of the parent dataset.
31+
**Example** | `{"isPartOf":"http://dx.doi.org/10.7927/H4PZ56R2"}`
32+
33+
34+
Example of a data asset collection in a data.json file
35+
--------------------------------------------------
36+
~~~~
37+
{
38+
"@type": "dcat:Dataset",
39+
"title": "Agency Financial Report, 2013",
40+
"accessLevel": "public",
41+
"contactPoint": {
42+
"@type": "vcard:Contact",
43+
"fn": "William Joe",
44+
"hasEmail": "mailto:[email protected]"
45+
},
46+
"description": "Annual financial reports by the U.S. Office of Personnel Management (OPM). The reports provide OPM's audited financial statements, notes, a management discussion and analysis, and other information. The reports also help Congress, the President, and the public assess OPM's stewardship over financial resources.",
47+
"identifier": "OPM-CFO-0007-2013",
48+
"isPartOf": "OPM-CFO-0007",
49+
"keyword": [
50+
"2013",
51+
"financial reporting",
52+
"financial statements",
53+
"management discussion and analysis"
54+
],
55+
"modified": "2013-09-30",
56+
"publisher": {
57+
"@type": "org:Organization",
58+
"name": "U.S. Office of Personnel Management"
59+
},
60+
"temporal": "2012-10-01/2013-09-30",
61+
"distribution": [
62+
{
63+
"@type": "dcat:Distribution",
64+
"downloadURL": "http://www.opm.gov/about-us/budget-performance/performance/2013-agency-financial-report.pdf",
65+
"mediaType": "application/pdf",
66+
"title": "2013-agency-financial-report.pdf"
67+
}
68+
],
69+
"bureauCode": [
70+
"027:00"
71+
],
72+
"programCode": [
73+
"027:000"
74+
]
75+
},
76+
~~~~

0 commit comments

Comments
 (0)