1. The total number of missing values within the dataset.
2. Which column / feature requires correction in the type of value they hold?
3. After imputation of nulls with mean what is the average value of the compressive strength in concrete?
4. The feature that has a moderately strong relationship with compressive strength in concrete is?
5. Standardize the dataset using standardscaler(), split the dataset into train and test of proportions 70:30 and set the random state to 1. Build a Linear Regression Model on the data and the resulting r-squared value is between which range?
# Importing libraries
import pandas as pd
import numpy as np
# Read csv file into a pandas dataframe
df = pd.read_csv("property data.csv")
# Take a look at the first few rows
print df.head()
Comments
Leave a comment