Complete the following:
Open RStudio and use File -> Recent Projects
to
select and open the R Project which is managing your GitHub
repository.
Use File -> Open File...
and navigate to the
location of your R Markdown notebook from Competition Assignment 2. Open
it.
Now that you’ve learned about techniques for extracting signals from
text features, its time to revisit the description
feature
in your data set.
Head back to the EDA section of your notebook and apply what you’ve learned about text analysis to your training data. Refer back to our recent notebooks about any functionality you’ll want to import or utilize.
Once you are satisfied with your EDA, head over to your Model
Construction and Interpretation section. See if you can improve
upon your existing models by extracting text-based features from the
description
feature.
Use your new model to make predictions for the properties in
comp.csv
. Submit those predictions to our class Kaggle
competition for scoring.
When you are done, use the blue ball of yarn button to knit the notebook into an HTML document.
Use the Git
tab in the top right pane of RStudio to
Pull, Commit, Push your new files to your remote repository at
GitHub.
Stop by my office if you have any questions or need help.