ALLEN BRAIN ATLAS API
The primary data of the Allen Mouse Brain Connectivity Atlas consists of high-resolution images of axonal projections targeting different anatomic regions or various cell types using Cre-dependent specimens. Each data set is processed through an informatics data analysis pipeline to obtain spatially mapped quantified projection information.
From the API, you can:
Download quantified projection values by structure
Download quantified projection values as 3-D grids
Query the source, target, spatial and correlative search services
Query the image synchronization service
Download atlas images, drawings and structure ontology
This document provides a brief overview of the data, database organization and example queries. API database object names are in camel case. See the main API documentation for more information on data models and query syntax.
Experimental Overview and Metadata
Experimental data from the Atlas is associated with the "Mouse Connectivity Projection" Product.
Each Specimen is injected with a viral tracer that labels axons by expressing a fluorescent protein. For each experiment, the injection site is analyzed and assigned a primary injection structure and, if applicable, a list of secondary injection structures.
Labeled axons are visualized using serial two-photon tomography. A typical SectionDataSet consists of 140 coronal images at 100 µm sampling density. Each image has 0.35 µm pixel resolution and raw data is in 16-bit per channel format. Background fluorescence in the red channel illustrates basic anatomy and structures of the brain, and the injection site and projections are shown in the green channel. No data was collected in the blue channel.
From the API, detailed information about SectionDataSets, SectionImages, Injections and TransgenicLines can be obtained using RMA queries.
- All experiments in the "Mouse Connectivity Projection" Product
- All experiments with an injection in the primary visual area (VISp, structure_id=385)
- Detailed metadata for one experiment with injection in the VISp (id=126862385) (http://api.brain-map.org/api/v2/data/query.xml?criteria=model::SectionDataSet,rma::criteria,[id$eq126862385],rma::include,specimen\(stereotaxic_injections\(primary_injection_structure,structures,stereotaxic_injection_coordinates\)\),equalization,sub_images,rma::options[order$eq'sub_images.section_number$asc'])
Figure: Projection dataset (id=126862385) with injection in the primary visual area (VISp) as visualized in the web application image viewer.
To provide a uniform look over all experiments, default window and level values were computed using intensity histograms. For each experiment, the upper threshold defaults to (2.33 x the 95th percentile value) for the red channel and (6.33 x the 95th percentile value) for the green channel. The default threshold can be used to download images and/or image region in 8-bit per channel image format.
In the web application, images from the experiment are visualized in an experimental detail page. All displayed information, images and structural projection values are also available through the API.
Figure:Experiment detail page for an injection into the primary visual area.
image download page to learn how to download images at different resolutions and regions of interest.See the
- RMA query to fetch meta-information of one projection image
- Download image downsampled by factor of 6 using default thresholds
- Download a region of interest at full resolution using default thresholds
Informatics Data Processing
The informatics data processing pipeline produces results that enable navigation, analysis and visualization of the data. The pipeline consists of the following components:
- an annotated 3-D reference space,
- an alignment module,
- a projection detection module,
- a projection gridding module, and
- a structure unionizer module.
The output of the pipeline is quantified projection values at a grid voxel level and at a structure level according to the integrated reference atlas ontology. The grid level data are used downstream to provide a correlative search service and to support visualization of spatial relationships. See the informatics processing white paper for more details.
3-D Reference Models
The cornerstone of the automated pipeline is an annotated 3-D reference space. For this purpose, a next generation of the common coordinate framework (CCF v3) is being created based on an average population of 1675 specimens. See the Allen Mouse Common Coordinate Framework whitepaper for detailed construction information. In this current release, the framework consists of 178 newly drawn structures spanning approximately half the brain. To support whole brain quantification, structures which have not yet been drawn are extracted and merged from the version 2 framework based on the Allen Reference Atlas. The interface between old and new structures were manually inspected and filled to create smooth transitions to create a complete brain map (~700 structures) for quantification.
Figure: The next generation Allen Mouse Common Coordinate Framework is based on shape and intensity average of 1675 specimens from the Allen Mouse Brain Connectivity Atlas. At the time of the May 2015, 178 structures on have been delineated on the anatomical template.
Structures in the common coordinate framework are arranged in a hierarchical organization. Each structure has one parent and denotes a "part-of" relationship. Structures are assigned a color to visually emphasize their hierarchical positions in the brain.
atlas drawings and ontologies page for more information.See the
All SectionDataSets are registered to ReferenceSpace id = 9 in PIR orientation (+x = posterior, +y = inferior, +z = right).
Figure: The common reference space is in PIR orientation where x axis = Anterior-to-Posterior, y axis = Superior-to-Inferior and z axis = Left-to-Right.
Volumetric data files available download:
USHORT(16 bit) anatomical template of CCFv3 - a shape and intensity average of 1675 specimen brains
FLOAT(32 bit) reconstructed Allen Reference Atlas Nissl deformably registered to the anatomical template of CCFv3
UINT (32bit) structure gray matter and fiber tract annotation of CCFv3
UINT (32bit) structure annotation extracted from the coronal Allen Reference Atlas and deformably registered to CCFv3
UINT (32bit) structure annotation extracted from the P56 Allen Developing Mouse Brain Reference Atlas and deformably registered to CCFv3
Each file is available in multiple voxel resolutions:
All volumetric data is compressed NRRD (Nearly Raw Raster Data) format. The raw numerical data is stored as a 1-D array raster as shown in the figure below.
Figure: Packing of 3-D volumetric data into a 1-D numerical array raster.
Example Matlab code snippet to read in the 25µm atlas and annotation volumes:
Example Matlab code snippet to read in the 100µm grid annotation volume:
The aim of image alignment is to establish a mapping from each SectionImage to the 3-D reference space. The module reconstructs a 3-D Specimen volume from its constituent SectionImages and registers the volume to the 3-D reference model by maximizing mutual information between the red channel of the experimental data and the average template.
Once registration is achieved, information from the 3-D reference model can be transferred to the reconstructed Specimen and vice versa. The resulting transform information is stored in the database. Each SectionImage has an Alignment2d object that represents the 2-D affine transform between an image pixel position and a location in the Specimen volume. Each SectionDataSet has an Alignment3d object that represents the 3-D affine transform between a location in the Specimen volume and a point in the 3-D reference model. Spatial correspondence between any two SectionDataSets from different Specimens can be established by composing these transforms.
"Image Sync" API methods is available to find corresponding positions between SectionDataSets, the 3-D reference model and structures. Note that all locations on SectionImages are reported in pixel coordinates and all locations in 3-D ReferenceSpaces are reported in microns. These methods are used by the Web application to provide the image synchronization feature in the multiple image viewer (see Figure).For convenience, a set of
- Sync a VISp and VISal experiment to a location in a SCs SectionDataSet
- Sync the P56 coronal reference atlas to a location in the SCs SectionDataSet
Figure: Point-based image synchronization. Multiple image-series in the Zoom-and-Pan (Zap) viewer can be synchronized to the same approximate location. Before and after synchronization screenshots show projection data with injection in the superior colliculus (SCs), primary visual area (VISp) anteolateral visual area (VISal), and the relevant coronal plates of the Allen Reference Atlas. All experiments show strong signal in the thalamus.
Projection Data Segmentation
For every Projection image, a grayscale mask is generated that identifies pixels corresponding to labeled axon trajectories. The segmentation algorithm is based on image edge/line detection and morphological filtering.
Figure: Signal detection for projection data with injection in the primary motor area. Screenshot of a segmentation mask showing detected signal in the ventral posterolateral nucleus of the thalamus (VPL), internal capsule (int), caudoputamen (CP) and supplemental somatosensory area (SSs). In the Web application, the mask is color-coded for display: green indicates a pixel is part of an edge-like object while yellow indicates pixels that are part of a more diffuse region.
Reference-aligned Image Channel Volumes
The red, green, and blue channels have been aligned to the 25um adult mouse brain reference space volume. These volumes have been stored in the API WellKnownFile table with type name "ImagesResampledTo25MicronARA". To retrieve the download link for a specific data set, query for WellKnownFiles of the appropriate type with an "attachable_id" equal to the data set id:
Download this by attaching the value of the download-link field to the API web host name (http://api.brain-map.org/api/v2/well_known_file_download/269830017). The download file will be a .zip file containing three images stored in the raw meta image format:
- resampled_red.mhd/raw: red background fluorescence
- resampled_green.mhd/raw: rAAV signal
- resampled_blue.mhd/raw: blue background fluorescence
All volumes have the same dimensions as the 25um adult mouse reference space volume.
Projection Data Gridding
For each dataset, the gridding module creates a low resolution 3-D summary of the labeled axonal trajectories and resamples the data to the common coordinate space of the 3-D reference model. Casting all data into a canonical space allows for easy cross-comparison between datasets. The projection data grids can also be viewed directly as 3-D volumes or used for analysis (i.e. target, spatial and correlative searches).
Each image in a dataset is divided into a 100 x 100 µm grid. Pixel-based statistics are computed using information from the primary image and the segmentation mask:
- projection density = sum of detected pixels / sum of all pixels in division
- projection intensity = sum of detected pixel intensity / sum of detected pixels
- projection energy = projection intensity * projection density
The resulting 3-D grid is then transformed into the standard reference space.
3-D Grid Data Service. The service returns a zip file containing the volumetric data for density, intensity and/or energy in an uncompressed format with a simple text header file in MetaImage format. Structural annotation for each grid voxel can be obtained via the ReferenceSpace gridAnnotation volume file at 100 µm grid resolution.Grid data can be downloaded for each SectionDataSet using the
Voxels with no data are assigned a value of "-1".
Example Matlab code snippet to read in the 100 µm density grid volume:
Comparing Projection Data Grids and Gene Expression Grids
Due to section sampling density, projection data grids are at 100µm resolution while gene expression grids are at 200µm resolution. Upsampling with appropriate interpolation of the gene expression data is necessary in order to numerically compare between the two different types of data. When interpolating the data, "no data" (-1) voxels needs to be handled specifically.
Example Matlab code snippet to upsample gene expression grid with "no data" handling:
Figure: ISH SectionDataSet (id=73636089) for gene Rasd2 showing enriched expression in the striatum (left). Projection SectionDataSet (id=73636089) with injection in the anterior part of the basolateral amygdalar nucleus (BLAa) showing projection to the striatum and other brain areas (center). One coronal slice of the BLAa projection density grid (green) merged with an upsampled and interpolated Rasd2 expression density grid (red).
Projection Structure Unionization
Projection signal statistics can be computed for each structure delineated in the reference atlas by combining or unionizing grid voxels with the same 3-D structural label. While the reference atlas is typically annotated at the lowest level of the ontology tree, statistics at upper level structures can be obtained by combining measurements of the hierarchical children to obtain statistics for the parent structure. The unionization process also separates out the left versus right hemisphere contributions as well as the injection versus non-injection components.
RMA. ProjectionStructureUnionize data is used in the web application to display projection summary bar graphs.Projection statistics are encapsulated as a ProjectionStructureUnionize object associated with one Structure, either left, right or both Hemispheres and one SectionDataSet. ProjectionStructureUnionize can be downloaded via
- Download structure projection signal statistics for one VISp injection experiment exclusive of injection area
- Download injection site statistics for the same experiment
- See the experiment detail page for the same experiment in the web application
- Take a look at the Cortico-thalamic Projection example application
Projection Grid Search Service
A projection grid service has been implemented to allow users to instantly search over the whole dataset to find experiments with specific projection profiles.
- The Source Search function retrieves experiments by anatomical location of the injection site.
- The Target Search function returns a rank list of experiments by signal volume in the user specified target structure(s).
- The Spatial Search function returns a rank list of experiments by density of signal in the user specified target voxel location.
- The Injection Coordinate Search function returns a rank list of experiments by distance of their injection site to a user specified seed location.
- The Correlation Search function enables the user to find experiments that have a similar spatial projection profile to a seed experiment when compared over a user-specified domain.
The projection grid search service is available through both the Web application and API.
To perform a Source Search, a user specifies a set of source structures. The service returns all experiments for which either the primary injection structure or one of its secondary injection structures corresponding to one of the specified source structures or their descendents in the ontology. The search results can also be filtered by a list of transgenic lines.
See the connected service page for definitions of service::mouse_connectivity_injection_structure parameters.
The output of the source search is a xml list of objects. Each object represents one experiment and contains information about the experiment including its unique identifier, the primary injection structure, list of any secondary injection structures, injection coordinates, injection volume and transgenic line name.
- Source search for experiments with injection in the isocortex
- Source search for experiments performed on wild-type specimens and with injection in the isocortex
- Source search for experiments performed on Syt6-Cre_KI148 cre-line specimens and with injection in the isocortex
Figure: Screenshot of source search results in the web application for experiments with injection in the isocortex. The injection location of each experiment is shown as a sphere on the 3D injection map.
To perform a Target Search, the user specifies a set of target structures. The service returns a rank list of experiments by signal volume in the target structures which are above a minimum threshold. The target structure specification can be further refined by hemisphere. The search results can also be filtered by a list of source structures and/or list of transgenic lines.
See the connected service page for definitions of service::mouse_connectivity_injection_structure parameters.
The output of the target search is a xml list of objects. Each object represents one experiment and contains information about the experiment including its unique identifier, the primary injection structure, list of any secondary injection structures, injection coordinates, injection volume and transgenic line name. Additionally, the total signal volume and number of voxels spanned by the target structure(s) is also reported.
- Target search for experiments with projection signal in the target structure LGd (dorsal part of the lateral geniculate complex) and injection in the isocortex
Figure: Screenshot of target search results in the web application for experiments with projection in target structure LGd (dorsal part of the lateral geniculate complex) and injection in the isocortex. The injection location of each experiment is shown as a sphere on the 3D injection map.
To perform a Spatial Search, a user selects a target location within the 3D reference space. The service returns a rank list of experiments by signal density in the target location and with density greater than 0.1.
See the connected service page for definitions of service::mouse_connectivity_target_spatial parameters.
The output of the target search is a xml list of objects. Each object represents one experiment and contains information about the experiment including its unique identifier, the primary injection structure, list of any secondary injection structures, injection coordinates, injection volume and transgenic line name. Additionally, the path from the target location to the injection site is listed along with signal density at each node.
- Spatial search for experiments with projection signal in a target location in VM (ventral medial nucleus of the thalamus)
Figure: Screenshot of spatial search results in the web application for experiments with projection in target location within VM (ventral medial nucleus of the thalamus). Each line in the 3D map is the computationally generated path from the target location to injection of one experiment.
Injection Coordinate Search
To perform an Injection Coordinate Search, a user specifies a seed location within the 3D reference space. The service returns a rank list of experiments by distance of its injection site to the specified seed location.
See the connected service page for definitions of service::mouse_connectivity_injection_coordinate parameters.
The output of the injection coordinate search is a xml list of objects. Each object represents one experiment and contains information about the experiment including its unique identifier, the primary injection structure, list of any secondary injection structures, injection coordinates, injection volume and transgenic line name. Additionally, distance between the injection site and seed location is also reported.
Example: Injection coordinate search for experiments with a seed location in VM (ventral medial nucleus of the thalamus)
To perform a Correlation Search, the user selects a seed experiment and a domain over which the similarity comparison is to be made. All voxels belonging to any of the domain structures form the domain voxel set. Pearson's correlation coefficient is computed between the domain voxel set from the seed experiment and every other experiment in the product. The return list is sorted by descending correlation coefficient.
See the connected service page for definitions of service::service::mouse_connectivity_correlation parameters.
The output of the injection coordinate search is a xml list of objects. Each object represents one experiment and contains information about the experiment including its unique identifier, the primary injection structure, list of any secondary injection structures, injection coordinates, injection volume and transgenic line name. Additionally, the Pearson's correlation coefficient between the experiment and the seed is reported.
- Correlation search for experiment with similar projection profile in the thalamus compared to seed experiment 112670853 (injection in primary motor area of the cortex)
Figure: Screenshot of top returns of a correlation search for experiments with similar projection to a MOp injection experiment (top-left) within the thalamus