# Report the best performance on the test set (in terms of MSE)

Learning Goal: I’m working on a machine learning exercise and need an explanation and answer to help me learn.For this assignment, you need to build a linear regression model from scratch. Below is a detailed instruction of what you may need to do. Dataset PreparationYou need to load the dataset using sklearn.datasets. load_diabetes. More information about the function can be found at: https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_diabetes.html#sklearn.datasets.load_diabetes
After loading the dataset, randomly shuffle the dataset to split the dataset to train/dev/test sets.Use the 70% of data for the train set, 15% for the dev set, and 15% for the test set
You need to make sure that the labels and features are still matching after shuffling the data.
You may want to use the random shuffle function provided by Numpy.
Univariate Linear Regression DevelopmentYou need to implement a univariate linear regression model from scratch.You need to use a gradient descent algorithm to solve the optimal parameters for the univariate linear regression model.
You need to implement the gradient descent algorithm from scratch.
The dataset contains 10 features; however for a univariate linear regression model, you may only use one feature. Thus, to build the linear regression model, you need to decide which feature to you. There are three approaches you may use:You may train 10 univariate linear regression models, one model for each feature, and select the one with the best performance on the dev set.
You may run a feature selection algorithm to select a feature. There are plenty of feature selection algorithms available. Feel free to search on the internet and use a method at your choice.
You may use PCA for dimension reduction to reduce the number of features to 1. There are many python libraries that provide PCA algorithms you may use. In fact, scikit-learn also has a function. Feel free to search on the internet and choose one at your choice.
Test the ModelTest the model using the test set.
SubmissionYou need to submit a written report for this assignment. For this report, you need to: Explain what you have done
Report the best performance on the test set (in terms of MSE)
Include your code as an appendixYou could save your Colab code as a PDF file and attach it to your report, or you could copy and paste your code into the report.If you want to copy/paste your code, make sure to maintain the appropriate indentation and make the code readable.
Evaluation Criteria: This assignment is worth 100 points. (50 pts) The code must be runnable and works as expected
(50 pts) The report must contain the three components
Requirements: elaborated   |   .doc file

Pages (275 words)
Standard price: \$0.00
Client Reviews
4.9
Sitejabber
4.6
Trustpilot
4.8
Our Guarantees
100% Confidentiality
Information about customers is confidential and never disclosed to third parties.
Original Writing
We complete all papers from scratch. You can get a plagiarism report.
Timely Delivery
No missed deadlines – 97% of assignments are completed in time.
Money Back