Find one or two datasets in kaggle or any other source.
Learning Goal: I’m working on a machine learning question and need an explanation and answer to help me learn.Use Spark Machine Learning library to complete this phase of the project.
1. [2 pts] Find one or two datasets in kaggle or any other source. Make sure that each dataset is at
least one GB. 2. [2 pts] Write a detailed description of each dataset. 3. [6 pts] Preprocess each dataset. 4. [2 pts] Divide each dataset into training and testing. 5. [12 pts] Build two regression models. 6. [4 pts] Test the models and compute their accuracy.Deliverable: One word file which contains the solution of each of the above questions: