slides - Bioinformatics Sannio

Transcript

slides - Bioinformatics Sannio
Bioinformatics
introduction
Luigi Cerulo
University of Sannio
Informatics and Biology
Why informatics is important for biology?
(love or interest?)
Erwing
Schrödinger
“
... life is the process of storage,
retrieving, and transmission of
biological information.
Central dogma of molecular biology
replication
replication
transcription
DNA
RNA
reverse
transcription
translation
Protein
Central dogma of molecular biology
replication
replication
transcription
DNA
RNA
reverse
transcription
Watson & Crick
1962 Nobel (medicine)
translation
Protein
Central dogma of molecular biology
replication
Roger D. Kornberg
2006 Nobel (chemistry)
replication
transcription
DNA
RNA
reverse
transcription
Watson & Crick
1962 Nobel (medicine)
translation
Protein
Central dogma of molecular biology
replication
Roger D. Kornberg
2006 Nobel (chemistry)
replication
transcription
DNA
translation
RNA
reverse
transcription
Watson & Crick
1962 Nobel (medicine)
Andrew Z. Fire, Craig C. Mello
2006 Nobel (medicine)
Protein
Central dogma of molecular biology
replication
Roger D. Kornberg
2006 Nobel (chemistry)
replication
transcription
DNA
V. Ramakrishnan, T. A. Steitz, A. E. Yonath
2009 Nobel (chemistry)
translation
RNA
reverse
transcription
Watson & Crick
1962 Nobel (medicine)
Andrew Z. Fire, Craig C. Mello
2006 Nobel (medicine)
Protein
Central dogma of molecular biology
replication
E.H. Blackburn, C.W. Greider, J.W.. Szostak
2009 Nobel (medicine)
Roger D. Kornberg
2006 Nobel (chemistry)
replication
transcription
DNA
V. Ramakrishnan, T. A. Steitz, A. E. Yonath
2009 Nobel (chemistry)
translation
RNA
reverse
transcription
Watson & Crick
1962 Nobel (medicine)
Andrew Z. Fire, Craig C. Mello
2006 Nobel (medicine)
Protein
Information can be:
created
transformed
destroyed! (not for matter)
L’informazione esiste in quanto supportata o
trasmessa da un supporto fisico
Information support of a cell
Computational Systems
Artificial
Natural
Human Genome
about 3,200 million of base pairs
Wellcome Collection Museum in London
Il genoma umano è lungo circa
1.1 metri
Contiene circa 3 miliardi di basi
Il tutto è impacchettato in uno
Human Genome
about 3,200 million of base pairs
Wellcome Collection Museum in London
Il genoma umano è lungo circa
1.1 metri
Contiene circa 3 miliardi di basi
Il tutto è impacchettato in uno
fits on 4
floppy disk
fits on 1
cd rom
Bioinformatics
•
Analysis
Bioinformatics (Paulien Hogeweg,
1970)
study of informatic processes in biotic systems
•
Biophysics
•
Biochemistry
You continued working
I
on topics related to molecuthe st
lar biology but how did the
your m
sequencing era affect your readays
search?
H
Hogeweg: At the bestarte
ginning of the 80s, the first
aroun
public data sets became
main
available. So we got indeed
cused
this very first set from the
pheno
EMBL and one of the very
that
first things we did was to
was s
create a multiple alignment
that
programme. It was actualcould
Hogeweg anno 1980
ly the first programme of
inform
this kind [J Mol Evol, 20:175-86]. It basiquence could not.
cally worked by generating a provisional
our hypercycle wor
phylogenetic tree, by clustering sequences
tern formation in s
based on pairwise alignments. Then, the
next step was aligning sequences progresI am aware that
sively along the tree, to obtain a multiple
portant in your res
Meaning now
•
Bioinformatics is a research discipline aimed at
resolving biological problems at molecular level by
mean of computational approaches (and statistics).
Il panda è un carnivoro
o un erbivoro?
è stato carnivoro
perchè si è scoperto
che nel panda il gene
T1R1 (responsabile
della digestione della
carne) è silente
Gli algoritmi di allineamento sono i principali strumenti forniti
dalla bioinformatica per confrontare genomi appartenenti a
specie diverse
: Biology’s Unifying Theme
ented
is a
Earth
d by
ms
as
ch
hing
time
re and
e very
ar and
on
atively
ee of
h an
her
o
and
milkare
its.
Giant panda
Spectacled bear
Ancestral
bear
Sloth bear
Sun bear
Common ancestor
of all modern bears
American black bear
Asiatic black bear
Common ancestor of
polar bear and brown bear
Polar bear
Brown bear
30
25
20
15
10
5
Millions of years ago
Figure 1.10 An evolutionary tree of bears. This tree is a hypothesis (a tentative
model) based on both the fossil record and a comparison of DNA sequences among
▲
DNA sudoku
• Su doku
• L’algoritmo per risolvere il sudoku è stato
ideato 2000 anni fa in Cina
• Recentemente è stato usato per
assemblare il DNA
Altri algoritmi
• Kabsch algorithm (Allineamento 3D di proteine)
• Baum–Welch algorithm (Risoluzione dei modelli Markoviani e ricerca
di motivi all’interno di un genoma)
• K-means clustering algorithm (Analisi di dati da microarray)
• Needleman–Wunsch algorithm (Allineamento globale di sequenze)
• Smith-Waterman algorithm (Allineamento locale di sequenze)
• Robinson-Foulds algorithm (Distanza tra genomi per ricostruire gli
alberi filogenetici)
• Ukkonen's algorithm (Alberi di suffisso per la ricerca veloce di
sequenze all’interno di un database)
A huge amount of bioinformatics tools?
A huge amount of bioinformatics tools?
Need we others?
A huge amount of bioinformatics tools?
?
?
?
Yes, because
technology evolves
new biological problems
…
Demand for Bioinformaticians profiles
•
Basic
just adopts available tools
•
Intermediate
develops simple
bioinformatics pipelines
•
Advanced
develops new tools
Experiments, in vivo, in vitro, ... in silico
The paradigm shift
The Human
genome
completed
Mendel
establish the
laws of
inheritance
1859
C. Darwin
publish
“Origin of
Specie”
1865
1953
J. Watson &
F. Crick
determine
the structure
of DNA
double helix
2001
The Scientific method
New Problem
Hypothesis/Theory
prediction
proof
deductive
reasoning
experimental
observations
inductive
reasoning
Research in biology (some years ago...)
New Problem
Hypothesis/Theory
prediction
proof
deductive
reasoning
experimental
observations
inductive
reasoning
Research in biology (Now...)
New Problem
Hypothesis/Theory
prediction
proof
deductive
reasoning
experimental
observations
inductive
reasoning
1953
2015
Social impact (now)
Personalized medicine
(It is a reality now)
•
Personal Genome Project
farmacogenomica
nutrigenomica
…
Steve Jobs, co-founder of Apple
Inc., was one of the first 20 people
in the world to have his DNA
sequenced, for which he paid
$100,000.
Social impact (far future)
•
Synthetic life
•
Genome manipulation
•
…