Conferences >2016 2nd International Confer...

Characteristics of Open Data CSV Files

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This work analyzes an Open Data corpus containing 200K tabular resources with a total file size of 413 GB from a data consumer perspective. Our study shows that ~10% of t...Show More

Metadata

Abstract:

This work analyzes an Open Data corpus containing 200K tabular resources with a total file size of 413 GB from a data consumer perspective. Our study shows that ~10% of the resources in Open Data portals are labelled as a tabular data of which only 50% can be considered CSV files. The study inspects the general shape of these tabular data, reports on column and row distribution, analyses the availability of (multiple) header rows and if a file contains multiple tables. In addition, we inspect and analyze the table column types, detect missing values and report about the distribution of the values.

Published in: 2016 2nd International Conference on Open and Big Data (OBD)

Date of Conference: 22-24 August 2016

Date Added to IEEE Xplore: 22 September 2016

ISBN Information:

DOI: 10.1109/OBD.2016.18

Conference Location: Vienna, Austria

Contents

References is not available for this document.

Characteristics of Open Data CSV Files

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Characteristics of Open Data CSV Files

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?