Metrics¶
Amount of Content¶
Number of Contributors¶
A contributor is a subject that is identified as an author of a dataset.
Authors can be:
The subject that added the content to DataONE. This is expressed in the system metadata as the
submitter
. However, the submitter may not be the author or creator of the content. For example, a data manager who had no involvement in the creation of the dataset may have uploaded the content.The authors identified in the science metadata associated with the dataset. The DataONE indexing service extracts this information from the metadata and populates the
author
and related fields of the search index.
Submission Contributors¶
Submission contributors is expressed as a single system metadata property which is indexed to the submitter
field.
The unique values of submitter
can be determined by examining the solr index with a facet query:
https://cn.dataone.org/cn/v2/query/solr/
?q=*%3A*
&facet=on
&rows=0
&facet.limit=-1
&facet.field=investigator
which can be processed with using xmlstarlet
and wc
as small bash script to show the number of unique values:
1#!/bin/bash
2
3SOLR_FIELD="${1}"
4URL="https://cn.dataone.org/cn/v2/query/solr/?q=*%3A*"
5URL="${URL}&facet=on&rows=0&facet.limit=-1"
6URL="${URL}&facet.field=${SOLR_FIELD}"
7echo "URL: ${URL}"
8curl -s "${URL}" | xml sel -t -m "//lst/lst/int" -v "@name" -n | wc -l
As of this writing, the number of unique submitters
is:
./field_count submiter
16340