Written by

Bernard Marr

Bernard Marr is a world-renowned futurist, influencer and thought leader in the fields of business and technology, with a passion for using technology for the good of humanity. He is a best-selling author of over 20 books, writes a regular column for Forbes and advises and coaches many of the world’s best-known organisations. He has a combined following of 4 million people across his social media channels and newsletters and was ranked by LinkedIn as one of the top 5 business influencers in the world.

Bernard’s latest books are ‘Future Skills’, ‘The Future Internet’, ‘Business Trends in Practice’ and ‘Generative AI in Practice’.

Generative AI Book Launch
View My Latest Books

Follow Me

Bernard Marr ist ein weltbekannter Futurist, Influencer und Vordenker in den Bereichen Wirtschaft und Technologie mit einer Leidenschaft für den Einsatz von Technologie zum Wohle der Menschheit. Er ist Bestsellerautor von 20 Büchern, schreibt eine regelmäßige Kolumne für Forbes und berät und coacht viele der weltweit bekanntesten Organisationen. Er hat über 2 Millionen Social-Media-Follower, 1 Million Newsletter-Abonnenten und wurde von LinkedIn als einer der Top-5-Business-Influencer der Welt und von Xing als Top Mind 2021 ausgezeichnet.

Bernards neueste Bücher sind ‘Künstliche Intelligenz im Unternehmen: Innovative Anwendungen in 50 Erfolgreichen Unternehmen’

View Latest Book

Follow Me

Big Data And AI: 30 Amazing (And Free) Public Data Sources

2 July 2021

Machine learning, artificial intelligence, blockchains, predictive analytics – all amazing technologies which have promised to revolutionise business and society.

They are useless, however, without data. Fortunately for businesses and organisations which don’t have the resources to methodically collect every piece of useful information, they will need themselves, a huge (and growing) amount is available freely online.

Two years ago I wrote an article listing 33 sources of Big Data available for free online. Of course, in business technology terms that was a lifetime ago, so here’s an update with thirty new entries:

1. World Bank Open Data Datasets covering population demographics and a huge number of economic and development indicators from across the world.

2. IMF Data The International Monetary Fund publishes data on international finances, debt rates, foreign exchange reserves, commodity prices and investments.

3. The US National centre for Education Statistics Data on educational institutions and education demographics from the US and around the world.

4. The UK Data Centre The UK’s largest collection of social, economic and population data.

5. FiveThirtyEight A large number of polls providing data on public opinion of political and sporting issues.

6.FBI Uniform Crime Reporting The FBI is responsible for compiling and publishing national crime statistics, with free data available at national, state and county level.

7. Bureau of Justice Here you can find data on law enforcement agencies, gaols, parole and probation agencies and courts.

8. Qlick Data Market Offers a free package with access to datasets covering world population, currencies, development indicators and weather data.

9. NASA Exoplanet Archive Public datasets covering planets and stars gathered by NASA’s space exploration missions.

10. UN Comtrade Database Statistics compiled and published by the United Nations on international trade. Includes Comtrade Lab which is a showcase of how cutting edge analytics and tools are used to extract value from the data.

11. Financial Times Market Data Up to date information on financial markets from around the world, including stock price indexes, commodities and foreign exchange.

12. Google Trends Examine and analyse data on internet search activity and trending news stories around the world.

13. Twitter The advantage Twitter has over the others are that most conversations are public. This means that huge amounts of data is available through their API on who is talking about what, where, when and why.

14. Google Scholar Entire texts of academic papers, journals, books and legal case law.

15. Instagram As with Twitter, Instagram posts and conversations are public by default. Their APIs allow likes, mentions and business details to be analysed.

16. OpenCorporates The world’s largest open database of companies.

17. Glassdoor API Information about job vacancies, candidates, salaries and employee satisfaction is available through their developer API.

18. IMDB Datasets Datasets in a number of formats drawn from the web’s largest resource on movies, television and people working in those industries.

19. OpenLibrary Data Dumps Datasets on books including catalogues from libraries around the world

20. Labelled Faces in the Wild 13,000 collated and labelledimages of human faces, for use in developing applications involving facial recognition.

21. Microsoft Marco Microsoft’s open machine learning datasets for training systems in reading comprehension and question answering.

22. Machine Learning Dataset Repository Collection of open datasets contributed by data scientists involved in machine learning projects.

23. eBay Market Data Insights Data on millions of online sales and auctions from eBay

24. Natural History Museum Data Portal Information on nearly 4 million historical specimens in the London museum’s collection, as well as scientific sound recordings of the natural world.

25. CERN Open Data More than one petabyte of data from particle physics experiments carried out by CERN.

26. One Million Audio Cover Images Dataset hosted at archive.org covering music released around the world, for use in image processing research

27. Complete Public Reddit Comments Corpus Over one billion public comments posted to Reddit between 2007 and 2015, for training language algorithms

28. Microsoft Azure Data Markets Free Datasets Freely available datasets covering everything from agriculture to weather

29. Irish Electric Vehicle Charge Point Status Collates data from the body which oversees the network of EV charge points across the Republic of Ireland and Northern Ireland.

30. LondonAir Pollution and air quality data from across London

I hope these sources are useful and as new ones become available every day I will be updating this list on a regular basis, so stay connected.

Business Trends In Practice | Bernard Marr
Business Trends In Practice | Bernard Marr

Related Articles

The Future Of Medicine: How AI is Shaping Patient Care And Drug Discovery

One of the most exciting aspects of AI is its implications for healthcare. Today, doctors and other medical professionals routinely augment their human skills and experience with the help of intelligent machines.[...]

Navigating The Future: 10 Global Trends That Will Define 2024

We’re approaching the mid-point of a decade in which we’ve already seen significant global transformation.[...]

Unlocking The Future Of Learning: How XR Tech Transforms Education

In the metaverse era, education as we know it will change. And I’m not just talking about formal education in schools, colleges, and universities – but also workplace learning and lifelong learning.[...]

2024 IoT And Smart Device Trends: What You Need to Know For The Future

By the end of 2024, there are projected to be more than 207 billion devices connected to the worldwide network of tools, toys, devices and appliances that make up the Internet of Things (IoT).[...]

The Evolving Internet: Navigating Risks Amidst Immersion, Decentralization, And Generative AI

The future internet is on the horizon, promising unprecedented engagement and innovation. Yet, as we incorporate immersive tech, decentralized systems, and generative AI, we also invite new complexities.[...]

The 8 Biggest Future Of Work Trends In 2024 Everyone Needs To Be Ready For Now

The world of work is constantly changing. Concepts that our parents or grandparents grew up with, such as the nine-to-five office and the job-for-life, are being consigned to the past.[...]

Sign up to Stay in Touch!

Bernard Marr is a world-renowned futurist, influencer and thought leader in the fields of business and technology, with a passion for using technology for the good of humanity.

He is a best-selling author of over 20 books, writes a regular column for Forbes and advises and coaches many of the world’s best-known organisations.

He has a combined following of 4 million people across his social media channels and newsletters and was ranked by LinkedIn as one of the top 5 business influencers in the world.

Bernard’s latest book is ‘Generative AI in Practice’.

Sign Up Today

Social Media

0
Followers
0
Followers
0
Followers
0
Subscribers
0
Followers
0
Subscribers
0
Yearly Views
0
Readers

Podcasts

View Podcasts