An elegant web tool developed at St. Jude makes it easy for any scientist to explore cancer genome data.
Scientists love data. But even the bravest can feel daunted when faced with billions of pieces of it. And what good is big data if nobody uses it?
This question is becoming increasingly important in childhood cancer research. As technologies like genome sequencing move into the clinic, avalanches of data are emerging about DNA changes that occur in childhood cancers. Now, the challenge is to get scientists excited about sifting through all that data to make new discoveries and advance cures.
Jinghui Zhang, PhD, has a simple solution: make it easy, and make it fun.
“How do you make using the data an enjoyable experience, rather than having to fight and struggle with the tools to make them work?” she asks. As chair of St. Jude Computational Biology, Zhang is an expert at analyzing big data to make big discoveries about childhood cancers.
With this goal in mind, she and her research team set about revolutionizing how scientists everywhere access and explore pediatric cancer data. The result is an elegant new Web application called ProteinPaint.
A luxury vehicle—with an incredible engine
ProteinPaint is like a sleek luxury car, beautiful to look at and a pleasure to drive. With a couple of clicks on clean, simple visuals, a scientist can be drawn into deep data about a particular childhood cancer and its most common genetic alterations.
“You don’t need to learn anything first—you can go directly and use it, and the interface is intuitive,” Zhang notes. “During the exploration, users can gain knowledge about the complexity by themselves—it is a visual-based navigation process that is intuitive to human nature. That’s what we tried to capture.”
But it’s not just the sleek exterior that makes the tool stand out. It’s what’s under the hood.
ProteinPaint is powered by the largest pediatric cancer database in the world, a data portal developed at St. Jude called PeCan. Through this incredible engine, ProteinPaint delivers information on nearly 27,500 genetic alterations from more than 1,000 pediatric cancer patients.
Adult cancer data is available too, and can be compared with pediatric data with a single click. Researchers can also upload and explore their own data sets using the tool.
A global resource for discovery
With ProteinPaint, Zhang hopes to empower more researchers to take the critical next steps: Use genomic data to make more accurate diagnoses. Learn how different DNA changes contribute to cancer. Develop precision therapies tailored for the genetic makeup of a patient’s cancer.
“We want this to be the definitive resource for genomic information for the pediatric cancer community,” Zhang says. “Collectively we can better understand, using our combined knowledge, what contributes to cancer.”
Try it out: http://pecan.stjude.org
From Promise, Spring 2016
A web tool developed at St. Jude, ProteinPaint helps scientists explore big data from cancer genome sequencing projects and make new discoveries. To learn more about the application, view the video for a first-hand look at its capabilities.