AbstractBiomarkers provide great potential for understanding and predicting patient outcomes. However, the large numbers of putative biomarkers assayed by many discovery technologies can be difficult to parse quickly to understand their potential as predictors of an outcome of interest. viSNE is a non-linear dimensionality reduction algorithm available on the Cytobank informatics platform that can rapidly assess whether a set of biomarkers can meaningfully group samples. We establish a method for applying viSNE to RNA-seq data. We demonstrate this method using data from The Cancer Genome Atlas for 1,011 samples from 5 different types of kidney cancer and matched normal tissue samples for 139 of the subjects. The tumors ranged from stage I to stage IV, and subjects varied by age, gender, and whether they were living when the sample was taken. Using FPKM data from the 806 most variable transcripts in these samples, viSNE was able to group the samples into six distinct “islands” that separated cancer samples from each of the five disease types and the matched normal tissue. We demonstrate that viSNE is robust to noise and is able to group the samples even when variables that do not contribute to the signal are present in the data, and illustrate how this can be leveraged as part of a larger biomarker discovery workflow with RNA-seq and other data types. viSNE allows rapid exploratory data analysis that helps to build understanding of correlations between biomarkers and patient outcomes and improve overall time-to-results in biomarker discovery pipelines.Citation Format: Ashu Sethi, Hannah Polikowsky, Katherine A. Drake. viSNE in Cytobank enables rapid exploratory data analysis for RNA-seq biomarker discovery [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2018; 2018 Apr 14-18; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2018;78(13 Suppl):Abstract nr 2265.