Learning R Featured R for Data Science Exercises: Web Scraping Web scraping involves programmatically extracting data from websites.
Learning R Featured R for Data Science Exercises: Joining Data Frames Joining data frames is a crucial data manipulation technique used to combine two or more data frames based on common columns or keys. This process allows you to merge datasets to create a more comprehensive and detailed dataset that can be used for further analysis.
Learning R Featured R for Data Science Exercises: Missing Values Missing values can occur for various reasons, such as data entry errors, equipment malfunctions, or respondents choosing not to answer certain survey questions. Properly handling these missing values is crucial to ensure the integrity and accuracy of your analyses.
Learning R Featured R for Data Science Exercises: Dates and Times Dates and times are hard because they have to reconcile two physical phenomena (the rotation of the Earth and its orbit around the sun) with a whole raft of geopolitical phenomena including months, time zones, and DST.
Learning R Featured R for Data Science Exercises: Factors Factors are a type of variable in R that are used to store categorical data. Factors can be ordered or unordered. Factors are stored as integers, and have labels associated with these unique integers.
Learning R Featured R for Data Science Exercises: Regular Expressions Regular Expressions, A.K.A., "regex" or "regexp" are a concise and powerful language for describing patterns within strings. With every punctuation character potentially overloaded with meaning, regular expressions are one of the most compact languages out there.
Learning R Featured R for Data Science Exercises: Strings Strings are not glamorous, high-profile components of R, but they do play a big role in many data cleaning and preparation tasks. R provides a solid set of string operations, but because they have grown organically over time, they can be inconsistent and a little hard to learn.
Learning R Featured R for Data Science Exercises: Communication Now that we understand your data, we need to communicate this understanding to others. To help others quickly build up a good mental model of the data, you will need to be as self-explanatory as possible. This chapter focuses on the tools you need to create good graphics.
Learning R Featured R for Data Science Exercises: Exploratory Data Analysis The goal during EDA is to develop an understanding of your data using questions as tools to guide your exploration: 1. What type of variation occurs within my variables?; 2. What type of covariation occurs between my variables?
Learning R Featured R for Data Science Exercises: Visualise Layers This section is based on the layered grammar of visualisations, facets, statistics, position adjustments, and coordinate systems to give you a fundamental understanding of plotting your data.
Learning R Featured R for Data Science Exercises: Workflow Scripts and Projects and Data Import The final installment of exercises from the "Whole game" section of the R4DS 2nd edition. These exercises focus on becoming more familiar and efficient in R studio.
Learning R Featured R for Data Science Exercises: Data Transformation Case Study In this case study, we will be comparing how often a player successfully hits the ball (H) to the total number of attempts they made to hit the ball (AB). Including a count ensures our analysis is based on a reasonable amount of data and not just a few instances.
Learning R Featured R for Data Science Exercises: Workflow Code Style and Data Tidying These exercises are focussed on furthering your workflow coding style and tidying messy data to make analysis easier.
Learning R Featured R for Data Science Exercises: Data Transformation These exercises are an introduction to the data transformation basics of R, including sorting, filtering, grouping, modifying, summarising, relocating and selecting data.
Learning R Featured R for Data Science Exercises: Workflow Basics These exercises are an introduction to the workflow basics of R, including being mindful of typos in code and keyboard shortcuts for faster coding!
Learning R Featured R for Data Science Exercises: ggplot2 These exercises are an introduction to the ggplot2 package used to visualise data in R.
Learning R Featured R for Data Science Exercises: Introduction The beginning of a series in which I'm working through the 'R for Data Science 2nd Edition' Exercises to learn R. Follow along if you're interested!