My first steps using the gganimate package using Game of Thrones data!
Why use GOT data? Because I was participating in #datosdemiercoles which is the spanish version of #tidytuesday. So the data is given and the purpouse is to learn new packages using that data and share with the community, you know this already, right?
Secondly the package I want to learn beside {ggforce} is {gganimate} so a very first idea was represent every character as a point and move according the actual affiliations. A kind of copy inspiration from FlowingData’s A Day in the Life of Americans1
The data come from this post where the shifting affiliations are visualized using an alluyvial diagram. It’s a nice post by Matthew Lunkes where he tell all the process to get the final chart:
In this case the data can be downloaded from this repository https://github.com/MattLunkes/GoT_Affiliations.
library(tidyverse)
data <- read_csv("https://raw.githubusercontent.com/MattLunkes/GoT_Affiliations/master/got_char.csv")
data
# A tibble: 488 × 11
Name Origin `Starting Affil… `End of S1` `End of S2` `End of S3`
<chr> <chr> <chr> <chr> <chr> <chr>
1 Tyrio… House … King Robert Bar… King Joffr… King Joffr… King Joffr…
2 Cerse… House … King Robert Bar… King Joffr… King Joffr… King Joffr…
3 Daene… House … Viserys Targary… Daenerys T… Daenerys T… Daenerys T…
4 Jon S… House … King Robert Bar… Night's Wa… Wildlings Night's Wa…
5 Sansa… House … King Robert Bar… King Joffr… King Joffr… King Joffr…
6 Arya … House … King Robert Bar… Other, Wes… Other, Wes… Other, Wes…
7 Jaime… House … King Robert Bar… King Joffr… King Joffr… King Joffr…
8 Jorah… House … Viserys Targary… Daenerys T… Daenerys T… Daenerys T…
9 Theon… House … King Robert Bar… Robb Stark… Balon Grey… King Joffr…
10 Samwe… House … Night's Watch Night's Wa… Night's Wa… Night's Wa…
# … with 478 more rows, and 5 more variables: End of S4 <chr>,
# End of S5 <chr>, End of S6 <chr>, End of S7 <chr>, Episodes <dbl>
As we see, the data comes in a not tidy way so gather
is our friend here.
data_long <- data %>%
janitor::clean_names() %>%
rename(end_of_s0 = starting_affiliation) %>%
select(-episodes, -origin) %>%
gather(season, affiliation, -name) %>%
mutate(
season = as.numeric(str_extract(season, "\\d+")),
affiliation = case_when(
affiliation == "King Robert Baratheon" ~ "Baratheon",
affiliation == "Viserys Targaryen" ~ "Targaryen",
affiliation == "King Joffrey Baratheon" ~ "Lannister",
affiliation == "Daenerys Targaryen" ~ "Targaryen",
affiliation == "Night's Watch" ~ "Night's Watch",
affiliation == "Other, Westeros" ~ "Westeros",
affiliation == "Wildlings" ~ "Wildlings",
affiliation == "King Tommen Baratheon" ~ "Lannister",
affiliation == "Petyr Baelish, Lord Protector of the Vale" ~ "The Vale",
affiliation == "Other, Essos" ~ "Essos",
affiliation == "Roose Bolton, Lord Paramount of the North" ~ "Bolton",
affiliation == "Queen Cersei Lannister" ~ "Lannister",
affiliation == "Jon Snow, King in the North" ~ "Stark",
TRUE ~ affiliation
)
)
# there are some repeated characters?
data_long <- data_long %>%
dplyr::semi_join(count(data, Name) %>% filter(n == 1), by = c("name" = "Name")) %>%
# importante for the ggrepel part
arrange(season, name, affiliation)
data_long
# A tibble: 3,592 × 3
name season affiliation
<chr> <dbl> <chr>
1 Addam Marbrand 0 Baratheon
2 Adrack Humble 0 Baratheon
3 Aeron Greyjoy 0 Baratheon
4 Aggo 0 Khal Drogo
5 Alliser Thorne 0 Night's Watch
6 Alton Lannister 0 Baratheon
7 Alys Karstark 0 Baratheon
8 Amory Lorch 0 Baratheon
9 Anara 0 Essos
10 Anguy 0 Baratheon
# … with 3,582 more rows
At the beginning I think use a circular layout and see what happend but the result was far for beign interesting, and as we can see I was a failure in my first attempt using {gganimate}.
Well, so the next idea and step was to get closer the affilations related. How can be two affiliations be related? An answer can be the the amount of characters which move from one to another.
ts <- data_long %>%
distinct(season) %>%
pull() %>%
head(-1)
change_season <- map_df(ts, function(t = 0){
full_join(
data_long %>% filter(season == t),
data_long %>% filter(season == t + 1),
by = "name",
suffix = c("_before", "_actual")
) %>%
count(from = affiliation_before, to = affiliation_actual) %>%
filter(complete.cases(.)) %>%
mutate(season = t)
})
change_total <- change_season %>%
group_by(from, to) %>%
summarise(n = sum(n)) %>%
ungroup()
change_total
# A tibble: 119 × 3
from to n
<chr> <chr> <int>
1 Balon Greyjoy, King of the Iron Islands Balon Greyjoy, King … 31
2 Balon Greyjoy, King of the Iron Islands Deceased 7
3 Balon Greyjoy, King of the Iron Islands Euron Greyjoy, King … 4
4 Balon Greyjoy, King of the Iron Islands Lannister 1
5 Balon Greyjoy, King of the Iron Islands Targaryen 3
6 Baratheon Balon Greyjoy, King … 9
7 Baratheon Brotherhood Without … 5
8 Baratheon Deceased 15
9 Baratheon Essos 1
10 Baratheon House Arryn (Neutral) 12
# … with 109 more rows
Now, with this data we can use the {igraph} package and the graph_from_data_frame
function to get a graph from the previous data frame and then get a layout.
Nice! but we need the positions instead of the image. So we’ll use the layout_with_fr
to get some layout of our graph.
set.seed(123)
layout <- layout_with_fr(g)
affiliations <- tibble(
affiliation = V(g)$name,
x = layout[, 2],
y = layout[, 1],
degree = degree(g)
)
affiliations <- data_long %>%
count(affiliation) %>%
left_join(affiliations, ., by = "affiliation")
affiliations
# A tibble: 27 × 5
affiliation x y degree n
<chr> <dbl> <dbl> <dbl> <int>
1 Balon Greyjoy, King of the Iron Islands -1.28 0.105 9 46
2 Baratheon -0.695 0.361 11 200
3 Bolton -0.205 0.388 5 18
4 Brotherhood Without Banners 0.0145 0.743 6 52
5 Deceased -0.421 -0.360 26 826
6 Dothraki -1.09 -1.49 6 100
7 Essos -0.229 -0.782 16 262
8 Essos Slavers -0.200 -1.44 6 113
9 Euron Greyjoy, King of the Iron Islands -2.08 0.617 2 4
10 High Sparrow -1.37 -0.319 4 7
# … with 17 more rows
At this point we are ready to use ggplot and check!
p1 <- ggplot(affiliations, aes(x, y, color = affiliation, label = affiliation, size = degree)) +
geom_point() +
geom_text() +
scale_size(range = c(1, 4)) +
theme(legend.position = "none") +
labs(title = "igraph laytout")
p1
This is really an improvement from the the circular layout. The downside is the main affiliations are too close so the text is overlaping. A simple solution to this was generate an equidistant sequence for every set of coordinates, \(x\) and \(y\).
We can compare the results:
Happy with the effect of a simple fix for the overlaping text. And I think this change keep the spirit of the original graph’s shape.
To get the character positions for every step/time/season we decided to put them in the corresponding affiliation making a circle around it and then adding a random noise
get_reg_poly_coords <- function(sides = 5, radius = 1, x0 = 0, y0 = 0) {
# https://stackoverflow.com/a/7198179/829971
x <- radius * cos(2*pi*(1:sides)/sides) + x0
y <- radius * sin(2*pi*(1:sides)/sides) + y0
return(tibble(x, y))
}
characters <- data_long %>%
count(season, affiliation) %>%
mutate(coords = map2(n, 1/nrow(affiliations), get_reg_poly_coords)) %>%
unnest(cols=c(coords)) %>%
select(-season, -affiliation) %>%
bind_cols(data_long, .) %>%
left_join(affiliations, by = c("affiliation"), suffix = c(".character", ".affiliation")) %>%
mutate(
x = x.character + x.affiliation,
y = y.character + y.affiliation
) %>%
mutate_at(vars(x, y), ~ .x + runif(length(.x), -1, 1)/nrow(affiliations))
p <- ggplot() +
geom_point(aes(x, y, color = affiliation), alpha = 0.5, data = characters) +
geom_text(aes(x, y, size = n, label = affiliation), alpha = 0.5, data = affiliations) +
scale_size_area() +
scale_color_viridis_d() +
theme(legend.position = "none")
p
Nice! We are almost there.
To get a very style GOT theme we need first the font, you can download from this link https://fontmeme.com/fonts/game-of-thrones-font/.[Thanks to (violetrzn)[https://twitter.com/violetrzn], https://github.com/violetr/tidytuesday/blob/master/datosdem2.R#L8] and use it with the {extrafont} package.
We’ll select some important characters to use with {ggrepel} package:
main_characters <- data %>%
select(name = Name, Episodes) %>%
arrange(desc(Episodes)) %>%
head(5)
knitr::kable(main_characters)
name | Episodes |
---|---|
Tyrion Lannister | 61 |
Cersei Lannister | 58 |
Daenerys Targaryen | 56 |
Jon Snow | 56 |
Sansa Stark | 54 |
First we need some setup for the font and colors.
Then, the usual {ggplot2} syntax.
p <- ggplot() +
# maint characters labels
ggrepel::geom_text_repel(
# geom_text(
aes(x, y, label = name),
seed = seed,
# box.padding = .5, force = 0.25,, max.iter = 5000,
color = color1,
size = 3,
family = font2,
vjust = "inward", hjust = "inward",
data = dplyr::semi_join(characters, main_characters, by = "name")
) +
# https://stackoverflow.com/a/34398935/829971
# maint characters points
geom_point(
aes(x, y),
size = 3,
alpha = 0.50,
color = color2,
stroke = 0,
shape = 16,
data = semi_join(characters, main_characters, by = "name")
) +
# rest of points
geom_point(
aes(x, y),
size = 3,
alpha = 0.20,
color = color2,
stroke = 0,
shape = 16,
data = dplyr::anti_join(characters, main_characters, by = "name")
) +
# labels affiliations
geom_text(
aes(x, y + 3 / nrow(affiliations), label = affiliation, size = degree),
data = affiliations,
color = color1,
alpha = 0.80,
family = font
) +
scale_size(range = c(2, 5)) +
labs(
title = "#",
caption = "#DataBrain",
x = NULL,
y = NULL
) +
theme(
legend.position = "none",
panel.border = element_blank(),
panel.background = element_blank(),
panel.grid = element_blank(),
axis.text.x = element_blank(),
axis.text.y = element_blank(),
axis.ticks = element_blank(),
rect = element_rect(fill = bckground, color = bckground),
text = element_text(family = font, colour = color1, size = 15),
plot.title = element_text(family = font, colour = color1, size = 25),
plot.subtitle = element_text(family = font2, colour = color1, size = 13),
plot.caption = element_text(family = font2, colour = color1, size = 10),
)
Finally add the {gganimate} magic:
library(gifski)
library(av)
library(gganimate)
p <- p +
labs(subtitle = "Affiliation changes in season {trunc(frame_time)}") +
transition_time(season) +
shadow_wake(wake_length = 0.005, alpha = TRUE, exclude_layer = 1) +
ease_aes("exponential-in-out")
For test purposes I recommend reduce de fps
to 10, and duration
as much you can according how many frames you are using so you can to check if the output animation is what you want quickly, then for the final output use at least 30 fps to get a smooth transition.
animate(p, fps = 30, duration = 8*3, width = 1000, height = 800)
And voilà: