ASP 460 2.0 Special Topics in Statistics

class: center, middle, inverse, title-slide

# ASP 460 2.0 Special Topics in Statistics
## Coordinate Systems and Axes
### Thiyanga Talagala
### 2020-05-19

---

# Introduction

- Position scales: How to place data points in the plot.

- Arrangement of axes

- horozontal x-axis and a vertical y-axis.

- y-axis run at an acute angle relative to the x-axis.
     
     - one axis run in a circle and the other run radially.

## Definition: coordinate systems

"The combination of a set of position scales and their relative geometric arrangement is called a coordinate system." (Claus O. Wilke)

---

# Cartesian Coordinates

## 2D Cartesian coordinate system

- Two orthogonal axes (horizontal x-axis, vertical y-axis).

- If X and Y axes are measured in the same units, the grid spacing for the two axes should be equal. The plot area should be a perfect square.

.pull-left[

```r
library(ggplot2)
ggplot(data=iris, 
       aes(x=Sepal.Length, 
           y=Sepal.Width))+
  geom_point()+
*  theme(aspect.ratio = 1)
```
]

.pull-right[
![](lecture7_files/figure-html/unnamed-chunk-2-1.png)

]

---
# 2D Cartesian coordinate system (cont.)

Cartesian coordinate systems are invariant under linear transformations.

.pull-left[

```r
library(ggplot2)
ggplot(data=iris, 
       aes(x=Sepal.Length, 
           y=Sepal.Width))+
  geom_point()+theme(aspect.ratio = 1)
```

![](lecture7_files/figure-html/unnamed-chunk-3-1.png)
]
.pull-right[

```r
library(ggplot2)
ggplot(data=iris, 
       aes(x=Sepal.Length*100, 
           y=Sepal.Width*100))+
  geom_point()+theme(aspect.ratio = 1)
```

![](lecture7_files/figure-html/unnamed-chunk-4-1.png)

]

---
# 2D Cartesian coordinate system (cont)

When two different types of variables are mapped to `$X$` and `$Y$` the plotting area can be stretched or compressed one relative to the other to maintain a valid visualization of the data.

Example: X-Temperature, Y-Date

![](lecture7_files/figure-html/unnamed-chunk-5-1.png)

---

# 2D Cartesian coordinate system (cont)

.pull-left[

**Square grid**

![](lecture7_files/figure-html/unnamed-chunk-6-1.png)

]

.pull-right[

**Rectangular grid**

]

---
# Nonlinear Axes

```r
x <- c(1, 3.2, 12, 35, 100, 1000); x
```

```
[1]    1.0    3.2   12.0   35.0  100.0 1000.0
```

```r
log_10_x <- log(x, base=10); log_10_x
```

```
[1] 0.000000 0.505150 1.079181 1.544068 2.000000 3.000000
```

![](lecture7_files/figure-html/unnamed-chunk-10-1.png)

---

# Nonlinear Axes (Cont.)

.pull-left[
![](lecture7_files/figure-html/unnamed-chunk-11-1.png)

]

.pull-right[

![](lecture7_files/figure-html/unnamed-chunk-13-1.png)
]
- Frist 3 plots are correct. Fourth graph is incorrect.

- No difference between plotting the log-transformed data on a
linear scale or plotting the original data on a logarithmic scale. The only difference lies in labelling values.

- First two are preferable.

- Third graph: high mental burden when reading the graph.

---

# Transformations

Both logarithm scale and square-root scale compresses larger range into a smaller range.

- Logarithmic scales

> When the dataset contains wide range of values.

- Square-root scale

> When data contains 0, use square-root transformation.

---

# Logarithmic scales

Logarithmic scales can emphasize the rate of change in a way that linear scales do not. Italy seems to be slowing the corona virus infection rate, while the number of cases in Spain continues to double every few days.

.pull-left[
![](lecture7_files/figure-html/unnamed-chunk-14-1.png)
]

.pull-right[
![](lecture7_files/figure-html/unnamed-chunk-15-1.png)
]

---

# Coordinate Systems with Curved Axes

We specify positions through **angles** and **radial distance** from the origin.

.pull-left[

**1. Cartesian coordinate system**

![](lecture7_files/figure-html/unnamed-chunk-16-1.png)

]

.pull-right[

**2. Polar coordinate system**

![](lecture7_files/figure-html/unnamed-chunk-17-1.png)

]

We have taken the x coordinates from (1) and used them
as angular coordinates and the y coordinates from part (1) and used them as radial
coordinates. The circular axis runs from 0 to 10 in this example, and therefore x = 0 and
x = 10 are the same locations in this coordinate system.
---

## Coordinate Systems with Curved Axes (cont.)

![](angular_coordinate_on_graph.jpg)

A Polar chart draws the x and y coordinates in each series as ($\theta$,$r$), where `$\theta$` is amount of rotation from the origin and `$r$` is the distance from the origin.
---

## Coordinate Systems with Curved Axes (cont.)

**Monthly anti-diabetic drug sales in Australia from 1991 to 2008.**

```r
autoplot(fpp2::qcement)
```

![](lecture7_files/figure-html/unnamed-chunk-18-1.png)

---
## Coordinate Systems with Curved Axes (cont.)

### Seasonal plots

- Emphasize seasonal patterns and show changes in these patterns over time.

- Seasonal plot is similar to a time plot except that the data are plotted against the individual “seasons” in which the data were observed.

- Both plots illustrate a sharp decrease in values in February.

- Notice the colour scale!!!

.pull-left[

![](lecture7_files/figure-html/unnamed-chunk-19-1.png)

]

.pull-right[

![](lecture7_files/figure-html/unnamed-chunk-20-1.png)

]

---
## Coordinate Systems with Curved Axes (cont.)

.pull-left[

![](lecture7_files/figure-html/unnamed-chunk-21-1.png)![](lecture7_files/figure-html/unnamed-chunk-21-2.png)

]

.pull-right[

- Polar coordinates are useful for data of a periodic nature (values at
one end of the scale can be meaningfully joined to data values at the other end).

- The polar version
shows how similar the sales are in December and January. In the Cartesian coordinate system, this fact is
obscured because the values in December and in January are
shown in opposite parts of the graph.
]

---

## Coordinate Systems with Curved Axes (cont.)

.pull-left[

![](lecture7_files/figure-html/unnamed-chunk-22-1.png)![](lecture7_files/figure-html/unnamed-chunk-22-2.png)

]

.pull-right[

- What is the angular coordinate?

- What is the radial coordinate?
]
---
# Coordinate systems in ggplot2

1) `coord_cartesian()`: Zooming into a plot

```r
base1 <- ggplot(iris, aes(Sepal.Length, Sepal.Width)) + 
  geom_point() + 
  geom_smooth()+ggtitle("base1")
# Scaling to 5--7 throws away data outside that range
base2 <- base1 + scale_x_continuous(limits = c(5, 7))+ggtitle("base2")
# Zooming to 5--7 keeps all the data but only shows some of it
base3 <- base2 + coord_cartesian(xlim = c(5, 7))+ggtitle("base3")
base4 <- base2 + coord_cartesian(ylim = c(2, 3.5))+ggtitle("base4")
library(patchwork)
base1|base2|base3|base4
```

![](lecture7_files/figure-html/unnamed-chunk-23-1.png)

---
# Coordinate systems in ggplot2 (cont.)

2) `coord_flip()`:  Flipping the axes

```r
base1 <- ggplot(iris, aes(x=Sepal.Length, y=Sepal.Width)) + 
  geom_point() + geom_smooth()+ggtitle("base1")
# curve is fitted to the original data, and then rotates
base2 <- ggplot(iris, aes(x=Sepal.Length, y=Sepal.Width)) + 
  geom_point() + geom_smooth()+
  coord_flip()+ggtitle("base2")
# curve is fit to the rotated data
base3 <- ggplot(iris, aes(x=Sepal.Width, y=Sepal.Length)) + 
  geom_point() + geom_smooth()+ggtitle("base3")
base1|base2|base3
```

![](lecture7_files/figure-html/unnamed-chunk-24-1.png)

---
# Coordinate systems in ggplot2 (cont.)

3) `coord_fixed()`: Equal scales

A fixed scale coordinate system forces a specified ratio between the physical representation of data units on the axes.

**ratio:**	aspect ratio, expressed as `$\frac{y}{x}$`

```r
p <- ggplot(mtcars, aes(mpg, wt)) + geom_point()
```

.pull-left[

```r
p + coord_fixed(ratio = 1)
```

![](lecture7_files/figure-html/unnamed-chunk-26-1.png)
]

.pull-right[

```r
p + coord_fixed(ratio = 5)
```

![](lecture7_files/figure-html/unnamed-chunk-27-1.png)
]

---
# Non-linear coordinate with ggplot2

```r
#Acknowledgement- ggplot2: Elegant Graphics for Data Analysis Hadley Wickham
rect <- data.frame(x = 50, y = 50)
line <- data.frame(x = c(1, 200), y = c(100, 1))
base <- ggplot(mapping = aes(x, y)) + 
  geom_tile(data = rect, aes(width = 50, height = 50)) + 
  geom_line(data = line) + 
  xlab(NULL) + ylab(NULL)
base1 <- base + ggtitle("base1")
base2 <- base + coord_polar("x") + ggtitle("base2 (line 50-60)")
base3 <- base + coord_polar("y") + ggtitle("base3 (line:  60-75)")
base1|base2|base3
```

![](lecture7_files/figure-html/unnamed-chunk-28-1.png)

---

## Pie Chart vs Bar Chart

The polar coordinates gives rise to pie charts, radar charts, etc.

### sample dataset

.pull-left[

```r
# Load ggplot2
library(ggplot2)
library(dplyr)

# Create Data
data <- data.frame(
  group=LETTERS[1:5],
  value=c(13,7,9,21,2)
)

data
```

```
  group value
1     A    13
2     B     7
3     C     9
4     D    21
5     E     2
```
]

.pull-right[

```r
# Compute the position of labels
data <- data %>% 
  arrange(desc(group)) %>%
  mutate(prop = value / sum(data$value) *100) 
data
```

```
  group value      prop
1     E     2  3.846154
2     D    21 40.384615
3     C     9 17.307692
4     B     7 13.461538
5     A    13 25.000000
```
]

---
### Pie Chart vs Bar Chart

.pull-left[

```r
# Basic barchart
ggplot(data, aes(x="", y=prop, 
                 fill=group)) +
  geom_bar(stat="identity", 
           width=1, color="white") 
```

![](lecture7_files/figure-html/unnamed-chunk-31-1.png)

]

.pull-right[

```r
# Basic piechart
ggplot(data, aes(x="", y=prop, fill=group)) +
  geom_bar(stat="identity", width=1, color="white") +
  coord_polar("y", start=0)
```

![](lecture7_files/figure-html/unnamed-chunk-32-1.png)

]

Angles are harder read accurately than aligned bars.

---

## Map projections: `coord_map()`

- This is another use of polar coordinates.

- `coord_map` projects a portion of the earth, which is approximately spherical, onto a flat 2D plane using any projection defined by the `mapproj` package

- The earth is a sphere and its locations are specified by their longitude and latitude.Plotting raw latitude and longitude as Cartesian axes is
misleading due to the spherical shape of the globe. Hence, we project data to  balances between conserving areas or angles relative to the true shape lines on the globe.

---
## Map projections (cont.)

```r
world <- map_data("world")
worldmap <- ggplot(world, aes(long, lat, group = group)) +
  geom_path() +
  scale_y_continuous(NULL, breaks = (-2:3) * 30, labels = NULL) +
  scale_x_continuous(NULL, breaks = (-4:4) * 45, labels = NULL)

w1 <- worldmap + coord_map()
# Some crazier projections
w2 <- worldmap + coord_map("ortho")
```

.pull-left[

```r
w1
```

![](lecture7_files/figure-html/unnamed-chunk-34-1.png)

]

.pull-right[

```r
w2
```

![](lecture7_files/figure-html/unnamed-chunk-35-1.png)

]

---

ASSIGNMENT

- Radar chart (star chart, spider diagram, web chart)

- Polar chart (Coxcomb plot, polar area plot)