Please try the demo files in the sidebar (Demo File Sets
).
Introduction
The 'LandScape' visualization is frequently utilized to provide a systematic illustration of integrative data from multiple layers of batch samples, which are always compared to each other on certain attributes, such as genes and biological pathways mutated in cancers. This online 'LandScape' visualization is designed as a fixed part (histogram and gene-panels) with additional panels (e.g., age, gender, and histology). To visualize data, upload a CSV file in the required format and use sidebar options to customize the display.
LandScape Data (CSV file)
The uploaded CSV file must match the required format as specified below.
- header
The first line of the file should be a header that contains column names as keys. - rows
Each row in the file should contain data for a sample. - samples (
SampleID
column)
The first column lists sample names in each row, with the keySampleID
in the header line. (mandatory) - gene-panel (
g_
columns)
Add columns for genes that you want to display. (mandatory)- The header line keyword follows
g_[GeneName]
format, such as 'g_TP53' and 'g_PTEN'. - Typically, the content of each table cell for one gene and one sample lists the mutations types, such as 'Mis', 'InDel', and 'Gain'. Use
-
for non-mutation, andN/A
for not-available. - A cell may contain multiple mutation types (or values) using semicolon as separator, e.g.,
Mis;Loss
. - Each cell allows three unique values at maximum. For example,
Mis;Splice_Site;Mis;Loss
is OK, whileMis;Splice_Site;InDel;Mis;Gain
is not allowed because it contains 4 distinctive mutation types. - All values appeared in
g_
columns are summarized to calculate the sample frequency in each gene, which is shown by the horizontal histogram at the 'left-area' of gene-panel. - Generally, in cancer research, genes are always annotated with more information, such as their pathway, GO ontology, certain comments, and P-value or Q-value representing their mutated significance in batch samples. These information (if given) will be shown at the 'right-area' of gene-panel as tags, matrix or histogram.
- for each pathway, add a row with key
g_Pathway
as theSampleID
in the first column and specify the Pathway name (e.g., Adhesion) for each gene in other columns. UseN/A
for not-available. (displayed as tags) - for each GO ontology, add a row with key
g_GO
as theSampleID
in the first column and specify the GO name (e.g., Metabolic) for each gene in other columns. UseN/A
for not-available. (displayed as tags) - for comments, add a row with key
g_Comments_[name]
in theSampleID
column, where[name]
should be the comments name, e.g., g_Comments_COSMIC. Values for comments must be boolean (Y
orN
), numeric values (e.g., 10 or 15.3), orN/A
. (displayed as matrix) - for P-value, add a row with key
g_P_value
in theSampleID
column and P-value (e.g., 0.001 or 3E-8) for each gene in other cells. Use 'N/A' for not-available. (displayed as histogram) - for Q-value, all should be the same as P-value except that the key should be
g_Q_value
. (displayed as histogram)
- for each pathway, add a row with key
- The header line keyword follows
- histogram (
ht
columns)
It is possible to add multiple stacked histograms that show attributes' distribution at the top of the visualization.- Column names for histograms should follow the
ht_[AttributeName]
format, such asht_Missence
andht_Truncated
. - Only numeric values (e.g., 15, 23.9 or 0.327) are allowed as cell content, and it keeps up to three decimals.
- Since the whole CSV file as a complete matrix, please use
-
to fill the intersecting cells between 'histogram' columns and 'gene-information' rows (i.e., these mentioned above: pathway, GO, comments, P-value, and Q-value, if exist). - To display multiple histograms, use
ht2_[attr]
for the second one,ht3_[attr]
for the third one, and so on. To allow flexibility,ht1_
andht_
are both treated as the first one. - Basically, histogram is simply named with their NO., e.g.,
ht2_
corresponds to 'Histogram 2'. Optionally, users can assign a name to each histogram by adding a bracketed value afterht
. For example,ht2(NewName)_
will be named as 'Histogram NewName' rather than 'Histogram 2'. The cooresponding section of this histogram in the sidebar will also use this new name. - Although it's not common, user can omit
ht
columns to avoid displaying any histogram in the figure.
- Column names for histograms should follow the
- additional panels
Add columns to show additional panels (more information on samples) at the bottom of the visualization. Each panel might contain several attributes belonging to one category, e.g., panel 'Individual-Info' contains attributes like age, gender, smoking.- The column name key should follow the
[PanelName]_[AttributeName]
format, e.g.,Individual-Info_age
andIndividual-Info_gender
.
Note that please avoid using '_' in PanelNames. Do not worry about panel naming because the sidebar provides options to customize the displaying name of each panel, where users can choose their preferred name such as 'Individual_Info' or 'Individual Info'. - The cell content could be strings, numeric values, or
N/A
.- strings: It is always used for classifications, e.g., gender ('Female' and 'Male'). We accept maximum six classifications besides
N/A
. An attribute will be considered as a classifications type as long as strings is found in any cell in its column. - numeric values: It is always used for continuously distributed numeric attributes, e.g., age. The attribute will be shown in gradient color ranging from the minimum value to the maximum value. Considering that sometimes the numeric values may be used for classifications, e.g., tumor stages (1,2,3), OR, users just want to design groups corresponding to several value ranges, we provide related options in the sidebar to cover such usages. Note here that the classifications sill needs to subject to the maximum six requirements.
- strings: It is always used for classifications, e.g., gender ('Female' and 'Male'). We accept maximum six classifications besides
- We have two reserved PanelName short names:
mtif
for 'Metadata' andpw
for 'Pathway'. Of course, you can just use the full original name as you wish. - By default, all panels will be displayed from top to bottom on the page in the order in which they appear in the uploaded CSV file.
- The column name key should follow the
Display Interactions
There are four types of interactions: Highlights, Tooltips, External Link and Download.
- Highlights
When the mouse moves on the LandScape figure, the column (sample) and row (gene or attribute) it points to will be highlighted. - Tooltips
Tooltips will show necessary information of object that the mouse points to.- histogram: SampleID, and value of the stacked area where the mouse points.
- gene-panel
- middle-area: SampleID, gene name, and relevant mutation types.
- left-area: Gene name, and count of relevant (mutated) samples.
- right-area: At gene-comments matrix, Gene name, comments name, and comments value. At P/Q-value histogram, Gene name, and P/Q-value.
- other-panels: SampleID, attribute name and value.
- External Link
- gene name: links to GeneCards search webpage.
- pathway: links to kegg pathway search webpage.
- GO: links to amigo search webpage.
- Download
One SVG file will be generated when the 'Download' button is clicked. Two themes are supplied: the default theme with a dark background and the light theme with white background. To use the light theme, please click the 'Light Theme' button.
Sidebar Functions
The sidebar provides diverse options to fine-tune the display, such as manage files, reset size and color, group and reorder objects, and so on.
- Files
- Manage Files: checklist of CSV files uploaded previously, delete or download the CSV files.
- Upload: upload LandScape CSV file. Note that the duplicated file name will be alerted and given a random postfix.
- Choose: choose files uploaded previously. Note that this function is ONLY available to registered users (each account has certain storage).
- File Sets: NOT available to this page.
- General
- Common: reset column width; choose color for 'N/A' (effective globally).
- Samples: reorder samples with a series of selected conditions OR manually. We prepare a list of conditions (sorted or reversed sorted), such as SampleID (ASCII), Histogram values, and attributes in additional panels. Note that the well-known 'Bisect sorting' of mutated genes in cancer research is also provided.
- Panels: reorder and rename additional panels.
- Histogram
If there are multiple histograms, they will be shown in separate sections one by one in the sidebar.- Settings: hide this histogram; reset the maximum value of the Y-axis; reset the Y-axis label text; display SampleID along the X-axis.
- Data: reorder the stacked groups; reset colors of the stacked groups.
- Gene panel
- Main: reorder genes manually; reset the colors of mutation types; group gene comments; reset gradient colors and value ranges of gene comments.
- Left: group mutation types; reset colors of mutation groups; reset maximum value of the axis; reset the axis label text.
- Right: choose to show (P/Q-value, Pathway, GO, or None); reset maximum value of P/Q-value histogram axis; reset the threshold red line of P/Q-value histogram.
- Additional panel(s)
If there are multiple additional panels, they will be shown in separate sections one by one in the sidebar.- Settings
- for classifications attribute, reorder the categories in legend (left-side), and reset colors of each classification.
- for numeric value attribute, reset gradient colors and value ranges, OR, enable groups displaying with customized value ranges.
- Reorder: reorder attributes in this additional panel.
- Settings
Manual version=1.3, written by Dr. JIA Wenlong and Mr. LI Hechen on 2020-03-11.
- Guo, G., Sun, X., Chen, C., Wu, S., Huang, P., Li, Z., ..., Gui, Y., Wang, J. and Cai, Z. (2013). Whole-genome and whole-exome sequencing of bladder cancer identifies frequent alterations in genes involved in sister chromatid cohesion and segregation. Nature Genetics, 45(12), pp.1459-1463. (PMID: 24121792, See Figure 1)
Demo File
: PMID24121792.Fig1.landscape.csv - Cancer Genome Atlas Research Network, et al. (2012). Comprehensive genomic characterization of squamous cell lung cancers. Nature, 489(7417), 519. (PMID: 22960745, See Figure 1)
Demo File
: PMID22960745.Fig1.landscape.csv - Gao, Y., Chen, Z., Li, J., Hu, X., Shi, X. J., Sun, Z., ..., Gao, S. and He, J. (2014). Genetic landscape of esophageal squamous cell carcinoma. Nature Genetics, 46(10), 1097. (PMID: 25151357, See Figure 1)
Demo File
: PMID25151357.Fig1.landscape.csv - Cancer Genome Atlas Research Network, et al. (2017). Integrated genomic and molecular characterization of cervical cancer. Nature, 543(7645), 378. (PMID: 28112728, See Figure 1)
Demo File
: PMID28112728.Fig1.landscape.csv - Lin, D. C., Meng, X., Hazawa, M., Nagata, Y., Varela, A. M., Xu, L., ..., Loh K. S. and Koeffler H. P. (2014). The genomic landscape of nasopharyngeal carcinoma. Nature Genetics, 46(8), 866. (PMID: 24952746, See Figure 1a and Figure 2)
Demo File
: PMID24952746.Fig1a_Fig2.landscape.csv - Hu, Z., Zhu, D., Wang, W., Li, W., Jia, W., Zeng, X., ..., Wang, H. and Ma, D. (2015). Genome-wide profiling of HPV integration in cervical cancer identifies clustered genomic hot spots and a potential microhomology-mediated integration mechanism. Nature Genetics, 47(2), 158. (PMID: 25581428, See Figure 2)
Demo File
: PMID25581428.Fig2.landscape.csv - Ojesina, A. I., Lichtenstein, L., Freeman, S. S., Pedamallu, C. S., Imaz-Rosshandler, I., Pugh, T. J., ..., Salvesen, H. B. and Meyerson, M. (2014). Landscape of genomic alterations in cervical carcinomas. Nature, 506(7488), 371. (PMID: 24390348, See Figure 1)
Demo File
: PMID24390348.Fig1.landscape.csv
Version
v1.1.1 (2020-05-18)
Developer
Mr. LI Hechen (GitHub)
Designer
Dr. JIA Wenlong (Scholar, ORCID, GitHub)
Updates
v1.1.1
- add manual selection of samples.
- refine highlights of column and row in Light Theme.
v1.1.0
- refine group list when reorder in histogram panel.
- apply white as default 'N/A' colour in Light Theme.
v1.0.9
- refine name setting of histogram-panel.
- refine legend and name displays of histogram-panel.
- refine options of histogram-panel in sidebar.
- fix 'Bisection' tag dragging in 'Reorder samples'.
- refine legend of the left-side histogram of gene-panel.
- fix hiding of the last histogram-panel.
- add notes on demo files when no input is uploaded.
v1.0.8
- enable mutation types gourp in left histogram of gene-panel.
- allow brackets and blank in attribute name.
- allow reorder mutation icons in left histogram legend of gene-panel.
v1.0.7
- optimize rows highlighting.
- optimize displays of the GO ontology.
- enable checking range rationality of gradient colors.
- fix the histogram options in 'Reorder samples'.
v1.0.6
- unify the display of 'N/A'.
- display Histogram name if exists.
- enable reset the start/end value of gradient displays.
- allow to hide Histogram(s).
- optimize the digital displays in tooltip of Histogram.
v1.0.5
- accept two or more Histograms.
- auto adjusts legend size.
- enables the color setting of 'N/A' globally.
- re-organize the 'General' section in the sidebar.
- allow 'N/A' in gene pathway data.
- enable downloading with grey or white background.
- optimize the alert when examine the input file data.
v1.0.4
- enable maximum clipping in 'Individuals with mutation'.
- add 'Histogram: Total' to reorder samples.
- auto-adjust the maximum value of Y-axis of Histogram.
- auto-adjust the gap size when SampleID displayed.
- reset Y-axis default label of Histogram.
- deal with gene comments as metadata method.
- accept the P-values in gene-panel.
- reorder metadata legends.
v1.0.3
- enable the color setting of mutations in gene-panel.
- enable the color setting of attributes in metadata-panel.
- enable panels setting: reorder and rename.
- enable the color setting of Histogram.
v1.0.2
- enable mutation group in 'Individuals with mutation'.
- enable to reorder groups in Histogram.
- enable to display SampleID in Histogram.
- accept the gene comments.
- accept Pathway of genes.
- rules to introduce more panels.
v1.0.1
- link gene-name to genecards webpage.
- highlight the row and column where the mouse moves.
- initial functions implemented.