Data

Easy Ways to Get and Create Datasets

Data Analytics is very much about data, and you need to get them from somewhere. Your organization’s systems are full of them, but for comparison and for telling a bigger story, external or self-made datasets may help.

Datasets and Tools for Capturing and Generating Data

Publicly available datasets, and methods, tools, and techniques for creating your own sets.

Table of share prices and related information from a newspaper

Datasets

Many datasets are available for data analysts, and they cover all kinds of topics. The best thing is that almost everything is free to download and use.

Hundreds of thousands of available datasets can definitely help you find and show valuable insights, also for comparing/benchmarking with your own company’s internal data.

The links are sorted after their access structure, showing which datasets are available for free, and which require some kind of payment or permission. And no matter if the data are free or not, there might be some kind of restriction on their use, if only to use a proper citation, so check each site and dataset carefully when downloading.

Active, Updated Data Sources

Providers of current data for research, benchmarking, or other real life purposes in business or research contexts.


  • Brigh Data Dataset Marketplace
    Scraped data from popular websites with products, reviews, etc.
  • Datarade
    Online dataset marketplace, business-relevant categories
  • DB IP
    IP-address based geolocation database and API, for web analytics, with paid subscriptions and free data available
  • Makersite
    Deep-tier supply chain data
  • Nasdaq Data Link
    Finance and economy datasets, of which some are free, requires login
  • Techsalerator
    Business datasets
  • UK Data Services
    Public data from the UK, most are “safeguarded” and require a university login

Historical Data

Older datasets, or sets about historical topics. Useful mostly for comparisons over time, or for general research.


Demo and Educational

Datasets used for practicing data analytics, or for showing the features of tools. Usually not used for commercial applications.


  • Coming soon

Tools for Capturing and Generating Data

Even if there are hundreds of thousands of available datasets for download and use, you may need to work with a different set of data made specifically for your project.

Different tools, methods, and services, exist to help you generating your datasets. The most simple, of course, is to just key in some data in a spreadsheet or other list, but many other ways exist.

Copy, Extract, Scrape

Getting data from existing databases or other sources.


  • data.world
    Online data catalog with workbench for teamwork on metadata management, possible to query and copy from public datasets
  • Coming soon

Calculate, Create, Measure

Getting data from sensors, manual or automatic generation methods.


  • Coming soon
  • Generatedata.com
    Generate test data in various formats
  • Mockaroo
    Generate test data in various formats
  • SAS DataMaker
    Synthetic data generator
  • Syntho
    Software for synthetic data creation, data management, and personal information masking

Interview, Observe, Survey

Asking or monitoring people, animals, or the universe.


  • Coming soon
  • Coming soon