Access and download the Charity dataset at https://www.kaggle.com/katyjqian/charity-navigator-scores-expenses-dataset
Load the data into R (or Python, etc)
(a) Plot a histogram of “Funding Efficiency in $ (amount spent to raise $1 in donations)” – it’s the variable fund_eff
(b) Is this a continuous or discrete random variable?
(c) What is the theoretical range of this random variable? What is its observed range?
(d) What are its mean and standard deviation?
(e) Present 5 different histogram versions (vary the bin size, number of bins)
(f) Comment on the shapes of these histograms. Do they tell a similar shape story?
(g) Present one histogram (your favorite of the bunch) with a smooth (kernel) density superimposed to do that in R, you can use the command lines( density(fund_eff) )
(h) Among all the densities and pmfs we’ve learned about, pick one that you think most closely resembles the shape of the histogram.
Comments
Leave a comment