This is a quick-start tutorial for
Cytoscape 3.
Cytoscape is an open source, cross-
platform bioinformatics program written
in Java. It is used to visualize molecular
interaction networks and integrate these
with expression profiles and other high
throughput data sets. Additional
functionality can be added by installing
plugins called "apps". We won't be
discussing apps any further in this video
but we wanted you to know that they
are available. For this tutorial, we'll be
using Cytoscape 3.6.1. If you're using
a different version of Cytoscape, some
options may have been changed or moved.
If any features discussed in this video
are different, please refer to the
documentation at manual.cytoscape.org.
We'll be following along with the
Basic Expression Analysis Tutorial in the
documentation. There is a more
in-depth version of this tutorial there
if you need additional guidance.
The original tutorial is covered by the
Creative Commons Zero v1.0
Universal license.
Let's jump right in!
Our first step will be to import a network
from the sample data provided by
Cytoscape. Click on the "Import Network
from File" button in the Tool Bar.
Navigate to the installation directory
and find the folder called "sampleData".
Inside that folder, let's open
"galFiltered.csv". Cytoscape is pretty
smart about choosing default settings
as long as your data is in a reasonable
format, so we're going to accept the
settings it chose for us. Just know that
there are a lot of options when you're
importing data for choosing the type of
data each column contains. Click "OK".
Cytoscape automatically tries to find a
good network structure for your data
after you first import it. We can play with
the layout later, but first let's get a view
of the entire network. Click the "Zoom
out to display all of current network"
button in the toolbar. The edges may be
gone in your network when you do this,
but you can get those back by clicking
"View > Show Graphics Details". This
option does use a bit more resources
when there is a lot of nodes in your
network, so it may be best to leave it off
when viewing large networks. The details
will reappear when you zoom in.
Now let's import our expression data.
Click on "Import Table from File" in the
toolbar, then choose "galExpData.csv".
Cytoscape should choose the proper default
settings, however you may need to choose
the proper key column to match with your
network depending on how your data is
structured.
If the import succeded, you should see the
new information in the Node Table. Let's
limit the columns shown by clicking on the
"Show Columns" button in the Table Panel.
Deselect the gal1RGsig, gal4RGsig, and
gal80Rsig columns.
The "shared name" contains the Ensembl ID,
but let's change the node labels to HUGO
Gene Nomenclature symbols contained in the
"COMMON" column.
Click on the "Style" tab. If you click on
"default", you'll see there are a lot of
different styles included with Cytoscape.
The button with 3 horizontal bars next to
the style selection dropdown has options
for managing the styles such as creating a
new style, copying the current style, and
renaming or removing the style.
Click on the second column of the "Label"
option. Change the column to "COMMON" and
you'll see the node labels change.
Now we want to see the expression values
shown on the network. Click on the second
column for the "Fill Color" option. For
the column, choose "gal80Rexp". For the
mapping type, let's choose "Continuous
Mapping".
Double click on the gradient to bring up
the "Continuous Mapping Editor". You can
delete individual gradient handles by
single clicking on them then hitting
"Delete". We need at least two handles to
have a gradient though, otherwise you'll
end up with a binary scale.
Click the "Add" button to add another
gradient handle.
Click and drag a handle to move it,
or single click it and edit the value in
the bottom-left.
The scale should contain your
largest and smallest values for your data,
but if it doesn't you can click on the
"Set Min and Max..." button, then click
"Reset" and "OK".
Let's create a blue-white-yellow gradient.
First, make sure there are three handles
at the top.
Drag the first handle or edit it to
approximately -1.2.
Double-click the handle
or single-click it and click the
"Node Fill Color" button,
then pick a color in the blue range.
Set the second handle to
approximately 0.5 and change the
color to white.
For the third handle,
change it to about 2.5 and yellow.
Double-click on the maximum handle and set
it to green,
then set the minimum handle to black.
Note that the default value is set to
light blue. A good trick is to set it to a
color outside the spectrum you're using to
see nodes without a value.
The columns with "sig" at the end contain
significance values. We can use those to
change the node shape so that nodes with a
significance below a set threshold can have
one shape, while those above it
can have another.
Click on the second column in "Shape",
then set the Column to "gal80Rsig" and the
mapping to "Continuous". Double-click on
the "Current Mapping" box, then click the
"Add" button in the bottom right.
Double-click on the left circle and change
it to a rectangle.
Click the handle between the shapes
and change the value to 0.05.
The "shared interaction" column in the
Edge Table uses "pp" for protein-protein
interactions, and "pd" for protein-DNA
interactions. Let's filter out the
protein-protein interactions.
Click on the "Select" tab in the Control
Panel. Click the "+" icon, then choose
"Column Filter". Choose "Edge: interaction"
and leave the next option as "contains".
Type "pp" into the text search box. The
filter should automatically be applied.
The "Apply when filter changes" option is
usually selected by default. For very
large networks, it is recommended to
disable this option, change all of the
filters at once, then click the "Apply"
button.
You should now see many edges in the
network selected. Let's remove the
protein-protein edges we just chose.
Click "Edit", then "Delete Selected Nodes
and Edges". Now there will be many
unconnected nodes. Let's clean it up by
changing the layout.
Click "Layout", then "Settings". You can
try different layouts and options here,
but we're interested in the "Preferred
Layout" tab. Make sure it is set to
"Prefuse Force Directed Layout".
Go back to "Layout", then select
"Apply Preferred Layout".
If we zoom in we can see that there are
three highly expressed nodes connected by
two intermediate nodes. Let's create a new
network from those two nodes.
Click on the first node, then hold control
and click on the second node.
Then click "Select > Nodes >
First Neighbors of Selected Nodes >
Undirected". Finally, click on "New
Network from Selection" in the toolbar.
You'll notice in the Network Panel that
this has created a new child network.
If our data contains identifiers used by
external databases, Cytoscape can
automatically link us to them. Right click
on the "GAL4" node, then choose
"External Links > Sequences and Proteins >
Entrez Gene". This will open your default
browser and perform a search using the
"name" identifier.
Cytoscape also provides the option to draw
charts and graphs on each node.
Let's remove the old fill color mappings
by right-clicking on the fill mapping
gradient and choosing "Edit > Remove
Mappings from Selected Visual Properties".
Change the default value back to gray.
Choose "Properties" at the top of the
panel, then choose
"Paint > Custom Paint 1 > Image/Chart 1".
Select the "Default" box, which will bring
up the "Graphics" dialogue.
Choose the "Charts tab" and move all of
the expression columns to the right side.
Change the type of graph to "Heat
Strips" and hit apply.
Thank you for watching this Quickstart
Tutorial for Cytoscape 3.
For more Cytoscape Tutorials, visit
tutorials.cytoscape.org
or visit my YouTube channel.