Thanh-Tuyen Nguyen-Tran

Logo

A corporate lawyer turned student of data and neuroscience.

LinkedIn

View My GitHub Profile

Exploring the crunchbase datasets using SQL

Project description

In this project, I explored the data from Crunchbase, the crowdsourced platform which lists startups, investors and their people.

The dataset was extracted via Kaggle on 17 November 2020. It starts in 1960 and is current as of October 2013 (before Crunchbase switched to its current paid API model). It includes all the entities, their people as well as funding roundsm acquisitions and IPOs. There are 11 tables in total that can be joined using unique IDs.

Objectives

The idea is to have a general idea of the investment scene and find out some characteristics.

This is a first and general analysis focused on these 3 questions:

  1. What type of funding is the most frequent through time? What type of funding does raise the highest amount?
  2. What kind of categories/industries attracted the highest amount of investment?
  3. Where can we look for new entrepreneurs/startups?

Results

This is the interactive dashboard with the end results after analysis using SQL.

Dashboard

For more details on the SQL analysis see this repository.

Back to the portfolio