PleaseFundThis.com is a web site that allows users to create projects in order to obtain
crowd-funding for creative pursuits such as such as films, music, stage shows, comics,
journalism, video games… and so on. Each project seeks monetary pledges from people.
Most often projects are offered tangible rewards and one-of-a-kind experiences in
exchange for their pledges.
Data from PleaseFundThis has been gathered and is available in the file
The columns within the spreadsheet are:
Penny Robinson is a very talented young woman. She can act, sing, write, paint etc. In
fact she can do anything creative.
Penny’s only problem is that she doesn’t have any money. She wants to obtain money
via crowd-funding to fund a creative project. She knows that you have a spreadsheet
full of data. She want to know if you can give her any advice about creating a crowdfunded
project based on the data if the file. E.g. what will make her project more likely to succeed?
Your tasks is to
Explore the data. Find information that you think will be useful to Penny (and to
other people who want to create similar crowd-funding projects).
Attempt to determine which (if any) what are the most important attributes of
the project in terms of whether a projects succeeds (meets the required funds)
List and present the most important attributes and justify why this is so (w
use of BigML screen dumps and supporting explanation / discussion).
You will need to do this based around two models.
In the first model, use any / all of the columns in the dataset. However some columns
probably aren’t suitable for analysis. BigML may automatically choose some column not
to be suitable (e.g. Project Id – as every row has a unique value, it provides no analytical
use). In other case, you may need to select which columns are not suitable for analysis.
In the second model, you must exclude these columns: percent_raised,
amt_pledged_$, avg_amt$_per_pledge, project_state, number_of_pledgers &
Suggestion. Save the Excel file as a CSV file before loading into BigML. This generally
allows column headings to be used field names rather than ‘field1’, ‘field2’ …