Overview

This vignette demonstrates how to use OME-TIFF files generated by the NanoString GeoMx instrument to enhance the data visualization for a NanoString GeoMx experiment. SpatialOmicsOverlay was specifically made to visualize and analyze the free-handed nature of Region of Interest (ROI) selection in a GeoMx experiment, and the immunofluorescent-guided segmentation process. The overlay from the instrument is recreated in the R environment allowing for plotting overlays with data like ROI type or gene expression.

In this vignette, we will be walking through a mouse brain dataset from the Spatial Organ Atlas

Package installation

Download package tarball from NanoString’s GeoScriptHub (link pending)

if (!requireNamespace("BiocManager", quietly=TRUE))
    install.packages("BiocManager")

BiocManager::install("GeomxTools")
BiocManager::install("SpatialOmicsOverlay")

Load Libraries

library(SpatialOmicsOverlay)
library(GeomxTools)

Data Ingestion

Files needed:

  1. OME-TIFF from GeoMx instrument (other OME-TIFFs should work with custom parsing functions)
  2. Lab Worksheet from instrument readout
  3. Annotation file(s)

Reading in the SpatialOverlay object can be done with or without the image. We will start without the image as that can be added later.

If outline = TRUE, only ROI outline points are saved. This decreases memory needed and figure rendering time downstream. If ANY ROIs are segmented in the study, outline will be FALSE. In this particular example, there are segmented ROIs, so we set outline = FALSE.

In this example, we are downloading a TIFF image from AWS S3 but this variable is simply the file path to a OME-TIFF.

This function will be downloading a 13 GB file and will keep a 4 GB file in BiocFileCache. This download will take ~15 minutes but you will only have to download once

tifFile <- downloadMouseBrainImage()
tifFile
## [1] "/home/biocbuild/.cache/R/SpatialOmicsOverlay/22729e44db382_mu_brain_004.ome.tiff"
muBrainLW <- system.file("extdata", "muBrain_LabWorksheet.txt",
                         package = "SpatialOmicsOverlay")

muBrain <- readSpatialOverlay(ometiff = tifFile, annots = muBrainLW, 
                              slideName = "D5761 (3)", image = FALSE,  
                              saveFile = FALSE, outline = FALSE)

The readSpatialOverlay function is a wrapper to walk through all of the necessary steps to store the OME-TIFF file components. This function automates XML extraction & parsing, image extraction, and coordinate generation. These functions can also be run separately if desired (xmlExtraction, parseScanMetadata, parseOverlayAttrs, addImageOmeTiff, createCoordFile).

If you are getting a java.lang.OutOfMemoryError: Java heap space when running readSpatialOverlay(), restart your R session and try increasing the maximum heap size by supplying the -Xmx parameter before the Java Virtual Machine is initialized used by a dependency package RBioFormats. Run the code below before loading any other libraries. RBioFormats source

options(java.parameters = "-Xmx4g")
library(RBioFormats)

SpatialOverlay Accessors

SpatialOverlay objects hold data specific to the image and the ROIs. Here are a couple of functions to access the most important parts.

#full object
muBrain
## SpatialOverlay 
## Slide Name: D5761 (3) 
## Overlay Data: 93 samples 
##    Overlay Names: DSP-1012999073013-A-A02 ... DSP-1012999073013-A-H09 ( 93 total ) 
## Scan Metadata 
##    Panels: TAP Mouse Whole Transcriptome Atlas 
##    Segmentation: Segmented 
## Outline: FALSE
#sample names
head(sampNames(muBrain))
## [1] "DSP-1012999073013-A-A02" "DSP-1012999073013-A-A03"
## [3] "DSP-1012999073013-A-A04" "DSP-1012999073013-A-A05"
## [5] "DSP-1012999073013-A-A06" "DSP-1012999073013-A-A07"
#slide name
slideName(muBrain)
## [1] "D5761 (3)"
#metadata of ROI overlays
#Height, Width, X, Y values are in pixels
head(meta(overlay(muBrain)))
ROILabel Sample_ID Height Width X Y Segmentation
001 DSP-1012999073013-A-A02 751 751 10363 15008 Geometric
002 DSP-1012999073013-A-A03 1239 782 14346 16630 Geometric
003 DSP-1012999073013-A-A04 1272 902 15664 16603 Geometric
004 DSP-1012999073013-A-A05 1119 1276 13707 16135 Geometric
005 DSP-1012999073013-A-A06 1054 1124 15932 16228 Geometric
006 DSP-1012999073013-A-A07 751 751 10756 14259 Geometric
#coordinates of each ROI
head(coords(muBrain))
sampleID ycoor xcoor
DSP-1012999073013-A-A02 15383 10363
DSP-1012999073013-A-A02 15356 10364
DSP-1012999073013-A-A02 15357 10364
DSP-1012999073013-A-A02 15358 10364
DSP-1012999073013-A-A02 15359 10364
DSP-1012999073013-A-A02 15360 10364

Plotting Without Image

After parsing, ROIs can be plotted without the image in the object. These plots are the highest resolution versions since there is no scaling down to the image size, and might take a little time to render. If the image is attached to the object, coordinates are automatically scaled down to the image size and plotted as if they are on top of the image.

While manipulating the figure, there is a low-resolution option for faster rendering times.

A scale bar is automatically calculated when plotting. This functionality can be turned off using scaleBar = FALSE. Scale bars can be fully customized using corner, textDistance, and variables that start with scaleBar: scaleBarWidth, scaleBarColor, etc.

plotSpatialOverlay(overlay = muBrain, hiRes = FALSE, legend = FALSE)

colorBy, by default, is Sample_ID but almost any annotation or data can be added instead, including gene expression, tissue morphology annotations, pathway score, etc. These annotations can come from a data.frame, matrix, GeomxSet object, or vector. Below we attach the gene expression for CALM1 from a GeomxSet object and color the segments by that value.

muBrainAnnots <- readLabWorksheet(lw = muBrainLW, slideName = "D5761 (3)")
muBrainGeomxSet <- readRDS(unzip(system.file("extdata", "muBrain_GxT.zip",
                                  package = "SpatialOmicsOverlay")))
muBrain <- addPlottingFactor(overlay = muBrain, annots = muBrainAnnots, 
                             plottingFactor = "segment")
muBrain <- addPlottingFactor(overlay = muBrain, annots = muBrainGeomxSet, 
                             plottingFactor = "Calm1")
muBrain <- addPlottingFactor(overlay = muBrain, annots = 1:length(sampNames(muBrain)), 
                             plottingFactor = "ROILabel")
muBrain
## SpatialOverlay 
## Slide Name: D5761 (3) 
## Overlay Data: 93 samples 
##    Overlay Names: DSP-1012999073013-A-A02 ... DSP-1012999073013-A-H09 ( 93 total ) 
## Scan Metadata 
##    Panels: TAP Mouse Whole Transcriptome Atlas 
##    Segmentation: Segmented 
## Plotting Factors: 
##    varLabels: segment Calm1 ROILabel 
## Outline: FALSE
head(plotFactors(muBrain))
segment Calm1 ROILabel
DSP-1012999073013-A-A02 Full ROI 1177 1
DSP-1012999073013-A-A03 Full ROI 765 2
DSP-1012999073013-A-A04 Full ROI 1045 3
DSP-1012999073013-A-A05 Full ROI 1730 4
DSP-1012999073013-A-A06 Full ROI 1119 5
DSP-1012999073013-A-A07 Full ROI 600 6

Customizing the graph

All generated figures are ggplot based so they can be easily customized using functions from that or similar grammar of graphs packages. For example, we can change the color scale to the viridis color palette.

Note: hiRes and outline figures use fill, lowRes uses color

plotSpatialOverlay(overlay = muBrain, hiRes = FALSE, colorBy = "Calm1", 
                   scaleBarWidth = 0.3, scaleBarColor = "green") +
    viridis::scale_color_viridis()+
    ggplot2::labs(title = "Calm1 Expression in Mouse Brain")

Adding the Image

Images can be added automatically using readSpatialOverlay(image = TRUE) or added after reading in the object.

An OME-TIFF file is a pyramidal file, meaning that many sizes of an image are saved. The largest having the highest resolution and decreasing as the image gets smaller. Images are 1/2 the size as the previous resolution.

Pyramidal TIFF

Pyramidal TIFF

The res variable says which resolution of the image to extract. 1 = largest image and the higher values get smaller. Each OME-TIFF has a different number of layers, with most having around 8. It is suggested to use the smallest res value, and highest resolution, your environment can handle. This is a trial and error process.

Using too big of an image will cause a java memory error. If this error occurs, increase your res value. Below is an example of the error you will receive if the resolution is too high for your system.

Error in .jcall("RBioFormats", "Ljava/lang/Object;", "readPixels", i,  : 
  java.lang.NegativeArraySizeException: -2147483648

The resolution size will affect speed and image resolution through the rest of the analysis. To check the smallest resolution size available, for the fastest speeds, use checkValidRes(). For the rest of this tutorial we will be using res = 8 for vignette size restrictions, but res 4-6 is recommended.

#lowest resolution = fastest speeds
checkValidRes(ometiff = tifFile)
## [1] 8
res <- 8
muBrain <- addImageOmeTiff(overlay = muBrain, ometiff = tifFile, res = res)
muBrain
## SpatialOverlay 
## Slide Name: D5761 (3) 
## Overlay Data: 93 samples 
##    Overlay Names: DSP-1012999073013-A-A02 ... DSP-1012999073013-A-H09 ( 93 total ) 
## Scan Metadata 
##    Panels: TAP Mouse Whole Transcriptome Atlas 
##    Segmentation: Segmented 
## Plotting Factors: 
##    varLabels: segment Calm1 ROILabel 
## Outline: FALSE 
## Image: /home/biocbuild/.cache/R/SpatialOmicsOverlay/22729e44db382_mu_brain_004.ome.tiff
showImage(muBrain)

plotSpatialOverlay(overlay = muBrain, colorBy = "segment", corner = "topcenter", 
                   scaleBarWidth = 0.5, textDistance = 130, scaleBarColor = "cyan")

Visualization Marker Legends

There are 2 ways to add a legend to the graph showing the immunofluorescent visualization markers used.

The first is an easy way for data exploration, adding a legend to the ggplot object directly by setting flourLegend = TRUE.

plotSpatialOverlay(overlay = muBrain, colorBy = "segment", corner = "topcenter", 
                   scaleBarWidth = 0.5, textDistance = 130, scaleBarColor = "cyan",
                   fluorLegend = TRUE)

The second requires more user manipulation but creates a more publication-ready figure. The flourLegend function creates a separate plot that can be added to the graph. The legend shape can be changed with nrow and the background can be changed using boxColor and alpha.

See ?draw_plot for more instructions on how to manipulate the legend position and scale.

library(cowplot)
gp <- plotSpatialOverlay(overlay = muBrain, colorBy = "segment", 
                         corner = "bottomright")
legend <- fluorLegend(muBrain, nrow = 2, textSize = 4, 
                      boxColor = "grey85", alpha = 0.3)
cowplot::ggdraw() +
    cowplot::draw_plot(gp) +
    cowplot::draw_plot(legend, scale = 0.105, x = 0.1, y = -0.25)

Image Manipulation

Flipping Axes

Images and overlays can be flipped across either axis to reorient the image. To flip both axes use flipY(flipX(overlay)). These functions update the coordinates and image rather than just affecting the figure. In this example, the original image is reversed from the traditional view of the mouse brain, so we shall flip the Y-axis.

muBrain <- flipY(muBrain)
plotSpatialOverlay(overlay = muBrain, colorBy = "segment", scaleBar = FALSE)

plotSpatialOverlay(overlay = flipX(muBrain), colorBy = "segment", scaleBar = FALSE)

Cropping

Images can be cropped 2 ways. The amount of area added to the cropped area in both methods can be defined by buffer. This adds a percentage of the final image size to each edge.

  1. cropTissue automatically detects where the tissue is and removes non-tissue area from around the tissue.
muBrain <- cropTissue(overlay = muBrain, buffer = 0.05)
plotSpatialOverlay(overlay = muBrain, colorBy = "ROILabel", legend = FALSE, scaleBar = FALSE)+
    viridis::scale_fill_viridis(option = "C")

2. cropSamples automatically crops the image around the ROIs given. Other ROIs in the cropped image can be kept in or ignored. Below we will crop to only ROIs that are unsegmented, hiding ROIs profiled which are segmented. Setting sampsOnly = TRUE hides the segmented ROIs which are within the plotted region.

samps <- muBrainAnnots$Sample_ID[muBrainAnnots$segment == "Full ROI" & 
                                     muBrainAnnots$slide.name == slideName(muBrain)]

muBrainCrop <- cropSamples(overlay = muBrain, sampleIDs = samps, sampsOnly = TRUE)
plotSpatialOverlay(overlay = muBrainCrop, colorBy = "Calm1", scaleBar = TRUE, 
                   corner = "bottomleft", textDistance = 5)+
    ggplot2::scale_fill_gradient2(low = "grey", high = "red", 
                                  mid = "yellow", midpoint = 2500)

muBrainCrop <- cropSamples(overlay = muBrain, sampleIDs = samps, sampsOnly = FALSE)
plotSpatialOverlay(overlay = muBrainCrop, colorBy = "segment", scaleBar = TRUE, 
                   corner = "bottomleft", textDistance = 5)

Image Coloring

Image colors are typically determined before downloading the OME-TIFF from the instrument but can be recolored here.

This recoloring must be done on the 4 channel image before converting to RGB. The color code and min/max intensities determine the coloring of the RGB image. To view the current color definition use the fluor function.

The color can be a hex color or a valid R color name. The dye can either come from the Dye or DisplayName columns from fluor(overlay). To change a color use the changeImageColoring function.

chan4 <- add4ChannelImage(overlay = muBrain)
fluor(chan4)
Dye DisplayName Color WaveLength Target ExposureTime MinIntensity MaxIntensity ColorCode
Alexa 488 FITC Blue 525nm GFAP 200.0 µs 50 18509 #0000feff
SYTO 83 Cy3 Green 568nm DNA 50.0 µs 4 805 #00fe00ff
Alexa 594 Texas Red Yellow 615nm Iba-1 300.0 µs 33 3174 #fefe00ff
Alexa 647 Cy5 Red 666nm NeuN 100.0 µs 88 30000 #fe0000ff
chan4 <- changeImageColoring(overlay = chan4, color = "#32a8a4", dye = "FITC")
chan4 <- changeImageColoring(overlay = chan4, color = "magenta", dye = "Alexa 647")
chan4 <- changeColoringIntensity(overlay = chan4, minInten = 500, 
                                 maxInten = 10000, dye = "Cy5")
fluor(chan4)
Dye DisplayName Color WaveLength Target ExposureTime MinIntensity MaxIntensity ColorCode
Alexa 488 FITC Lightseagreen 525nm GFAP 200.0 µs 50 18509 #32a8a4
SYTO 83 Cy3 Green 568nm DNA 50.0 µs 4 805 #00fe00ff
Alexa 594 Texas Red Yellow 615nm Iba-1 300.0 µs 33 3174 #fefe00ff
Alexa 647 Cy5 Magenta 666nm NeuN 100.0 µs 500 10000 magenta
# change 4 channel TIFF to RGB
chan4 <- recolor(chan4)
showImage(chan4)

Future Directions

In future releases of SpatialOmicsOverlay, we will be

  1. Ensuring capability with NanoString’s CosMx Spatial Molecular Imager outputs
  2. Adding ability to add graphing features on top of the image
  3. Adding image analysis capabilities
  4. Adding extraction of image data to use in ML/AI applications
sessionInfo()
## R version 4.3.1 (2023-06-16)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 22.04.3 LTS
## 
## Matrix products: default
## BLAS:   /home/biocbuild/bbs-3.18-bioc/R/lib/libRblas.so 
## LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.10.0
## 
## locale:
##  [1] LC_CTYPE=en_US.UTF-8          LC_NUMERIC=C                 
##  [3] LC_TIME=en_GB                 LC_COLLATE=C                 
##  [5] LC_MONETARY=en_US.UTF-8       LC_MESSAGES=en_US.UTF-8      
##  [7] LC_PAPER=en_US.UTF-8          LC_NAME=en_US.UTF-8          
##  [9] LC_ADDRESS=en_US.UTF-8        LC_TELEPHONE=en_US.UTF-8     
## [11] LC_MEASUREMENT=en_US.UTF-8    LC_IDENTIFICATION=en_US.UTF-8
## 
## time zone: America/New_York
## tzcode source: system (glibc)
## 
## attached base packages:
## [1] stats4    stats     graphics  grDevices utils     datasets  methods  
## [8] base     
## 
## other attached packages:
## [1] cowplot_1.1.1             GeomxTools_3.6.1         
## [3] NanoStringNCTools_1.10.0  ggplot2_3.4.4            
## [5] S4Vectors_0.40.1          Biobase_2.62.0           
## [7] BiocGenerics_0.48.0       SpatialOmicsOverlay_1.2.1
## [9] RBioFormats_1.2.0        
## 
## loaded via a namespace (and not attached):
##   [1] RColorBrewer_1.1-3      jsonlite_1.8.7          magrittr_2.0.3         
##   [4] ggbeeswarm_0.7.2        magick_2.8.1            farver_2.1.1           
##   [7] nloptr_2.0.3            rmarkdown_2.25          zlibbioc_1.48.0        
##  [10] vctrs_0.6.4             memoise_2.0.1           minqa_1.2.6            
##  [13] RCurl_1.98-1.12         base64enc_0.1-3         htmltools_0.5.6.1      
##  [16] plotrix_3.8-2           curl_5.1.0              cellranger_1.1.0       
##  [19] sass_0.4.7              parallelly_1.36.0       bslib_0.5.1            
##  [22] htmlwidgets_1.6.2       plyr_1.8.9              cachem_1.0.8           
##  [25] uuid_1.1-1              commonmark_1.9.0        lifecycle_1.0.3        
##  [28] pkgconfig_2.0.3         Matrix_1.6-1.1          R6_2.5.1               
##  [31] fastmap_1.1.1           GenomeInfoDbData_1.2.11 future_1.33.0          
##  [34] digest_0.6.33           numDeriv_2016.8-1.1     colorspace_2.1-0       
##  [37] GGally_2.1.2            reshape_0.8.9           RSQLite_2.3.1          
##  [40] labeling_0.4.3          filelock_1.0.2          progressr_0.14.0       
##  [43] fansi_1.0.5             httr_1.4.7              abind_1.4-5            
##  [46] compiler_4.3.1          withr_2.5.1             bit64_4.0.5            
##  [49] tiff_0.1-11             viridis_0.6.4           DBI_1.1.3              
##  [52] MASS_7.3-60             rjson_0.2.21            tools_4.3.1            
##  [55] vipor_0.4.5             beeswarm_0.4.0          future.apply_1.11.0    
##  [58] glue_1.6.2              nlme_3.1-163            EBImage_4.44.0         
##  [61] gridtext_0.1.5          grid_4.3.1              reshape2_1.4.4         
##  [64] generics_0.1.3          gtable_0.3.4            data.table_1.14.8      
##  [67] sp_2.1-1                xml2_1.3.5              utf8_1.2.4             
##  [70] XVector_0.42.0          markdown_1.11           pillar_1.9.0           
##  [73] stringr_1.5.0           rJava_1.0-6             splines_4.3.1          
##  [76] dplyr_1.1.3             ggtext_0.1.2            BiocFileCache_2.10.1   
##  [79] lattice_0.22-5          bit_4.0.5               tidyselect_1.2.0       
##  [82] locfit_1.5-9.8          Biostrings_2.70.1       pbapply_1.7-2          
##  [85] knitr_1.44              gridExtra_2.3           IRanges_2.36.0         
##  [88] scattermore_1.2         xfun_0.40               pheatmap_1.0.12        
##  [91] stringi_1.7.12          fftwtools_0.9-11        yaml_2.3.7             
##  [94] boot_1.3-28.1           evaluate_0.22           codetools_0.2-19       
##  [97] tibble_3.2.1            cli_3.6.1               systemfonts_1.0.5      
## [100] munsell_0.5.0           jquerylib_0.1.4         Rcpp_1.0.11            
## [103] GenomeInfoDb_1.38.0     readxl_1.4.3            globals_0.16.2         
## [106] EnvStats_2.8.1          dbplyr_2.4.0            png_0.1-8              
## [109] XML_3.99-0.14           parallel_4.3.1          blob_1.2.4             
## [112] jpeg_0.1-10             bitops_1.0-7            lme4_1.1-34            
## [115] listenv_0.9.0           viridisLite_0.4.2       ggthemes_4.2.4         
## [118] ggiraph_0.8.7           lmerTest_3.1-3          scales_1.2.1           
## [121] SeuratObject_4.1.4      purrr_1.0.2             crayon_1.5.2           
## [124] rlang_1.1.1