21  Results of Genomic Analysis

⬜ Developing Hypotheses
⬜ Sample Collection
⬜ Outbreak Investigation
⬜ Sequencing
⬜ Bioinformatics
🟩 Molecular Epidemiology
⬜ Public Health Implementation

Timeline of Outbreak Events Open Timeline in New Window

Line list of cases likely linked to outbreak

Sample ID Collection Date Notes
Bat_Trainer_01 2031-01-13 Worked with the muubats; attended conference
Bat_Trainer_02 2031-01-13 Worked with the muubats; attended conference
Canine_Trainer_01 2031-02-01 Works closely with workers 01 & 02; interacted with bat trainer 01; works with canines
Duck_Trainer_01 2031-02-04 Works closely with worker 06; works with ducks
Duck_Trainer_02 2031-02-05 Works closely with worker 06; works with ducks
Duck_Trainer_03 2031-02-06 Works closely with duck trainers and works with ducks
Feline_Trainer_01 2031-01-21 Married to bat trainer 02; works closely with the felines
Resident_01 2031-02-10 Exposed to resident 03 that visited the Safari Park Canine areas
Resident_02 2031-02-10 Exposed to resident 03 that visited the Safari Park Canine areas
Resident_03 2031-02-11 Visited the Safari Park Canine areas
Resident_04 2031-02-11 Exposed to resident 03 that visited the Safari Park Canine areas
Resident_05 2031-02-13 Visited the Safari Park, interacted with workers and animals
Resident_06 2031-02-13 Could not reach
Resident_07 2031-02-13 Could not reach
Resident_08 2031-02-13 Visited the Safari Park, interacted with workers and animals
Resident_09 2031-02-13 Could not reach
Worker_01 2031-01-22 Works closely with bat trainer 01 and canine trainer 01
Worker_02 2031-01-22 Works closely with bat trainer 01 and canine trainer 01
Worker_03 2031-01-20 Works with the bat trainers
Worker_04 2031-01-19 Works with the bat trainers
Worker_05 2031-01-20 Works with the bat trainers
Worker_06 2031-02-08 Works with the duck trainers
Worker_07 2031-02-09 Works with the duck trainers and the ducks

Line list of cases in outbreak. The line list shows the sample IDs of the sequenced cases in the outbreak. The second column shows the collection date of the samples collected from the case in year-month-day format. The third column shows the exposure notes collected from the case interviews conducted during the outbreak investigation. Unfortunately some of the cases, such as the residents, could not be reached. Some information remains incomplete for the workers as some workers tend to work in different areas, and are involved in different responsibilities when it comes to taking care of the animals and the park.

Phylogenetic Tree of Flu-Like Illness Outbreak, Fuchsia City, Safari Zone

Open tree in new tab for full view

Phylogenetic Tree of Outbreak. This phylogenetic tree shows the evolutionary relationships between sequences from the outbreak. The tips on the tree represent individual genome sequences of the virus, which are color-coded by species in relation to the park. The x-axis represents divergence (also sometimes referred to as genetic distance, or the number of mutations from the reference genome). To give an example, the reference genome is 5000 bases long, so a divergence of 0.0006 would mean 3 mutations difference. (5000 * 0.0006).

Note that identical sequences like muubat_02, muubat_03, and muubat_04 lie on the same x-axis and are stacked vertically. Comparing the muubat sequences with the sequences of the residents, the branch length of the resident is much longer indicating that the viruses that infected the muubats are less closely related to the residents.

Exposure Map of Animals, Trainers, and Workers

Exposure Map. The exposure map uses information from the case interview to piece together how animals, trainers, and workers might be linked together in the outbreak. Lines indicate that the case was either exposed to or had an interaction with the other cases or animals.

Heatmap of SNP Distances of Sequences in Outbreak

Heatmap. The heatmap shows the pairwise distance between two sequences. The pairwise distance is given by the number of single nucleotide polymorphisms (SNPs), or single base difference between two sequences. The range of SNP differences for the sequenced cases in the outbreak is 0 to 9. A SNP distnace of 0 (darkest color) means that two sequences are identical, while a higher SNP count (lighter color) indicates greater genetic divergence. Comparing this heatmap with the phylogenetic tree helps confirm whether clusters of closely related sequences correspond to expected epidemiological links. Sequences that are darker in color indicate fewer SNP differences between the two sequences, whereas lighter color indicates more SNP differences between the two cases.

21.1 Interpreting the Results

“Let’s take some time to review the results. We should each try to come up with our own interpretations and then compare to see what seems to be best explained by the data.”

The phylogenetic tree revealed patterns in the genetic relationship between the sequences generated from each sample. Some sequences clustered more closely together whereas others showed greated divergence. Meanwhile, the SNP heatmap highlighted key genetic differences, with certain sequences appearing more similar while others displayed increasing variation over time. The team deliberated: what might these patterns suggest about how the virus spread and changed throughout the outbreak?

21.2 Discussion Twelve: Interpreting Phylogenetic Trees

A. Based on the phylogenetic tree, what can you infer about the relationships between the samples? Are there any distinct clusters or outliers?

B. Does the tree structure align with the exposure map? Are the spatial and genomic patterns consistent with one another, or do you notice any discrepancies?

C. When comparing the heatmap (pairwise SNP distances) to the tree, do any clusters or distance patterns reinforce the phylogenetic relationships you observed?

D. Taking all three data sources into account, what conclusions can you draw about transmission dynamics or potential sources? Does this analysis support or challenge your earlier hypothesis?