Manuale corso base Evidence Based Practice

Transcript

Evidence Based Practice
Gabriele Manzi
Azienda U.S.L. di Bologna
GIMBE®
The Evolution of Health Care
DOING
THINGS
CHEAPER
Efficiency
1970s
GIMBE®
DOING
THINGS
BETTER
DOING
THINGS
RIGHT
DOING
THE
RIGHT
THINGS
Effectiveness
1980s
1990s
DOING
THE
RIGHT
THINGS
RIGHT
Quality
Improvement
21st Century
Evidence Based Medicine
L’EBM costituisce un approccio alla pratica clinica
dove
le
decisioni
cliniche
risultano
dall'integrazione tra l'esperienza del medico e
l'utilizzo conscenzioso, esplitico e giudizioso delle
migliori evidenze scientifiche disponibili, mediate
dalle preferenze del paziente.
David Sackett
GIMBE®
What it is and what it isn’t
Research
evidence
Patient
preferences
Clinical
expertise
Sackett et al, BMJ 1996
GIMBE®
What it is and what it isn’t
Patients’ preferences
and actions
Research
evidence
Clinical
expertise
Clinical state and
circumstances
GIMBE®
Health Care
Resurces
Modificato da Haynes RB et al, BMJ 2002
E.B. Pharmacy
E.B. Midwifery
EBM
EBN
EBPH
Evidence Based Health Care
Clinical Governance
GIMBE®
ANATOMIA DELL’INFORMAZIONE
BIOMEDICA
RICERCA
PRIMARIA
• Studi
Osservazionali
• Studi
Sperimentali
RICERCA
SECONDARIA
FONTI
TERZIARIE
• Revisioni
Sistematiche
• Linee Guida
• Analisi
economiche
• Analisi decisionali
• Revisioni
tradizionali
• Trattati
• Editoriali
• Parere degli
esperti
Distorsione dei risultati
GIMBE®
I principi dell’evidence
based practice
GIMBE®
PREREQUISITI
9
9
9
Componente essenziale: attitudine ad
avvertire il bisogno di informazione
Componente tecnica: abilità pratiche per
effettuare
le
ricerche
e
valutare
criticamente
Giudizio clinico: pesare, valutare e
integrare le evidenze nel processo
decisionale.
Cartabellotta A. Rec Prog Med 2002
GIMBE®
Come prendiamo le decisioni nella pratica
professionale?
• Scegliere tra due interventi assistenziali (quale
medicazione posizionare),
• Decidere la
tipologia di paziente su cui
intraprendere un intervento (su quali pazienti
applico le strategie di prevenzione delle lesioni
da decubito),
• La tempistica delle prestazioni (ogni quanto
realizzare la medicazione di un CVC)
• Cosa e come comunicare con i pazienti e le loro
famiglie (comunicare il rischio di caduta),
• Come organizzare l’assistenza (modalità di
consegna)
• …
GIMBE®
Come prendiamo le decisioni nella pratica
professionale?
•
•
•
•
•
•
•
GIMBE®
Insegnamento scolastico
Tradizioni/abitudini di reparto
Esperienza
Autoreferenzialità
Prescrizioni di …
Libri, riviste, …
…
Le conseguenze di ciò
Difformità della pratica
Erogazione di prestazioni non ottimali
Scarsa crescita professionale
Esposizione dei pazienti a rischi non
necessari
Determinazione di costi non giustificati
…
GIMBE®
Un esempio:
da “Le basi scientifiche dell’assistenza infermieristica” di Paola di Giulio
Prostatecmia
radicale
Due giorni
prima
Giorno prima
Giorno
dell’intervento
GIMBE®
Clinica Urologica
Divisione Urologia
-
Dieta senza scorie
* 3 per 2 cp Humatin
* 2 buste Selg (e/o Isocolan)
Dieta
* Pranzo senza scorie Cena
liquida; 3 per 2 cp Humatin
Clisma con Neomicina 1%
Tricotomia ampia (dai capezzoli
fino a metà coscia)
Doccia con betadine
Digiuno dalla mezzanotte
Digiuno totale
Clisma ore 6.00 con neomicina 1%
Doccia con Betadine
Antibiotico profilassi
Purgante
Tricotomia xifopubica e scrotale
Digiuno dalla mezzanotte
Calze antiembolo
Doccia con betadine
…una considerazione
“Meno del 20 % di ciò che i medici fanno ogni
giorno possiede almeno uno studio clinico ben
disegnato a sostegno della sua utilità.”
Cochrane A.L. ” Effectiveness and Efficiency. Random reflection on Health.
Service” Nuffield Provincial hospital trust, London, 1972.
E quello che fanno gli infermieri?
GIMBE®
Le origini dell’evidence
based practice
GIMBE®
Archibald L. Cochrane
(1909-1988)
GIMBE®
Evidence-based Medicine: la storia 1972
Archibald L. Cochrane, un epidemiologo inglese,
sosteneva che i risultati della ricerca avevano un
impatto molto limitato sulla pratica clinica.
“E’ causa di grande preoccupazione constatare come
la professione medica non abbia saputo organizzare
un sistema in grado di rendere disponibili, e
costantemente aggiornate, delle revisioni critiche
sugli effetti dell'assistenza sanitaria".
In altre parole Cochrane, suggeriva di rendere
disponibili a tutti i pazienti solo gli interventi sanitari di
documentata efficacia.
GIMBE®
Evidence-based Medicine: la storia 1986
L'attenzione di Sackett e coll. si sposta
progressivamente da “come leggere la
letteratura biomedica” a “come utilizzare la
letteratura biomedica per risolvere i problemi
clinici”.
David Sackett
GIMBE®
Evidence Based Nursing
GIMBE®
Evidence = Evidenza ?
In Italiano
– È evidenza qualsiasi cosa che sia palese e
chiara, per la quale non è necessaria
qualsiasi ulteriore dimostrazione.
In Inglese
– È evidenza qualsiasi cosa che ha provato la
propria validità mediante il metodo scientifico.
È evidenza ciò che è stato dimostrato.
Evidenza = Prova di efficacia ?
GIMBE®
Evidence-Based Nursing (EBN)
Processo per mezzo del quale le infermiere e
gli infermieri assumono le decisioni cliniche
utilizzando le migliori ricerche disponibili, la loro
esperienza clinica e le preferenze del paziente,
in un contesto di risorse disponibili.
Di Censo A, Cullum N, Ciliska D.
“Implementing evidence based nursing: some misconceptions”
Evidence Based Nursing 1998; 1:38-40
GIMBE®
Evidence-Based Nursing (EBN)
Opinion Based Nursing – O.B.N.
Evidence Based Nursing – E.B.N.
GIMBE®
Perché la ricerca
A
t
t
i
v
i
t
à
c
l
i
n
i
c
a
• Individuare gli ambiti dove c’è variabilità
• Dove c’è variabilità, c’è incertezza
• Incertezza = area dove è necessario
fare chiarezza
• Per fare chiarezza bisogna attivare la
ricerca
• La ricerca è il motore che produce
conoscenza quale riferimento per
guidare la pratica professionale
Paolo Chiari - Centro Studi EBN
S.Orsola-Malpighi - Bologna
GIMBE®
Quando la ricerca e quando l’EBN
Quando abbiamo un dubbio ricorrere
all’EBN, prima di pensare alla conduzione di
uno studio, è senz’altro una precauzione
che può farci risparmiare molto tempo ed
energie a patto che non siamo nell’area
grigia.
GIMBE®
Il processo dell’EBN
Avere chiaro rispetto ad un problema
assistenziale cosa voglio cercare, perché, e
quali risultati devo andare a misurare (aree
di incertezza)
Porre le proprie domande alle banche dati,
ricercando quelle che sono le conoscenze
scientifiche disponibili
Valutare criticamente i risultati
Applicazione delle conoscenze acquisite al
problema originario
GIMBE®
EBN – Strumento & Processo
Selezionare un aspetto del proprio lavoro da
mettere in discussione (bisogno di informazione)
Trasformare il bisogno di informazione in quesiti
clinici ben definiti
Ricercare con la massima efficienza, attraverso le
fonti bibliografiche ed elettroniche le migliori
evidenze disponibili
Valutare criticamente le evidenze trovate, la loro
validità ed applicabilità
Diffondere ed integrare le evidenze trovate
Miglioramento continuo della pratica professionale
GIMBE®
I limiti dell’Evidence
Based Practice
GIMBE®
I limiti della EBM
Esistenza di “zone grigie”:
Zone in cui vi è incertezza sull’efficacia.
EBM si basa sulle migliori evidenze
“disponibili”, non sulle migliori evidenze
“possibili”.
GIMBE®
Le “aree grigie”: lo spazio dell’opinion based
RICERCA
Interventi
efficaci
Area grigia
PRATICA CLINICA
EBP
Copyright © 1996-2007 - GIMBE®
GIMBE®
Interventi
inefficaci
I limiti dell’EBP nella realtà italiana
Necessità di acquisire nuove competenze
Mancanza di tempo
Scarsa attitudine a mettere in discussione le proprie
conoscenze e a generare quesiti clinicoassistenziali.
Barriere linguistiche
Limitata diffusione/accesso agli strumenti informatici
(computer, internet)
Scarsa familiarità informatica
Resistenza al cambiamento
Scarsa disponibilità di riviste
GIMBE®
Le caratteristiche della
letteratura biomedica
GIMBE®
The Ascent of Evidence
(and the exhaustion of man)
GIMBE®
Clarke M.J. BMJ 1998
Le caratteristiche dell’informazione
biomedica
Crescita esponenziale:
circa 2.000.000 di articoli pubblicati ogni
anno in oltre 20.000 riviste
sino a 400.000 citazioni aggiunte
annualmente au MEDLINE
GIMBE®
Le caratteristiche dell’informazione
biomedica
Obsolescenza:
Progresso scientifico e tecnologico.
Ritardo nella disseminazione ed utilizzazione delle
conoscenze.
Frammentazione:
Articoli che trattano lo stesso argomento sono
pubblicati in riviste ed annate diverse.
Basso rapporto segnale/rumore:
La maggior parte della produzione scientifica è
irrilevante per la pratica clinica.
GIMBE®
WARNING !!!
Trials are conducted and published without a systematic
review of existing evidence, meaning that trials are
conducted unnecessarily or don't address the most
important questions
Results are selectively published with positive results
emphasised or published more than once, and negative
results are ignored
Trials favourable to sponsors are published in major
journals, and unfavourable trials are not published at all
or are published in minor journals
Smith R, Roberts I. Patient Safety Requires a New Way to Publish Clinical Trials. PLoS Clinical Trials (2006)1(1)
GIMBE®
WARNING !!!
Una ricerca effettuata su 3.600 ricercatori e 4.000
borsisti ha dimostrato che:
0,3% ha inventato o falsificato dati
6% non ha pubblicato dati che erano in contrasto con le
sue ricerche precedenti
15,5% ha cambiato il disegno della ricerca per
accontentare lo sponsor
4,7% ha pubblicato più volte gli stessi risultati
15,3% non ha utilizzato tutti i risultati ottenuti
Nature 2005; 435: 718-9
GIMBE®
EBN
Fasi del processo:
Conversione del bisogno di informazione in
quesiti clinici ben definiti.
; Ricerca con la massima efficienza delle migliori
evidenze disponibili.
; Valutazione della loro validità ed applicabilità
clinica.
; Integrazione delle evidenze nella pratica clinica.
; Rivalutazione continua della propria performance
professionale.
;
GIMBE®
Approcci di ricerca
Cercare attivamente nelle banche dati
rispetto ad un quesito formulato
(SEARCHING).
Consultare
periodicamente
alcune
riviste per individuare novità sui temi di
abituale interesse (SCANNING).
GIMBE®
Gestione dell’informazione
SCANNING
cerco ciò che
sorveglianza
mi serve
SEARCHING
GIMBE®
SCANNING
Seleziono alcune riviste e le leggo in modo
regolare
•che cosa trovo?
– abstract
– full text
•problemi
– time consuming
– aspecificità
GIMBE®
SEARCHING
• Risorse primarie
– B.D. generali: Medline, Embase, …
– B.D. specialistiche: Cinhal, Pedro,
Joanna Briggs, …
• Risorse secondarie:informazioni predigerite
– B.D. revisione sistematiche: Cochrane
library, Joanna Briggs, …
– B.D. linee guida: New Zeland guideline
group, Sign, National Guideline
Clearinghouse, …
GIMBE®
IL QUESITO
ovvero
Come definire un quesito clinico che ci permetta di
ricercare la risposta nella letteratura
GIMBE®
Focalizzando la domanda si:
chiarisce l’obiettivo della ricerca
facilita
l’utilizzo
di
regole
appropriate nella scelta dei titoli e
dei riassunti degli articoli che sono
stati individuati
GIMBE®
FORMULARE DOMANDE PERTINENTI
ALLE QUALI SIA POSSIBILE
RISPONDERE
GIMBE®
I quesiti clinico-assistenziali
appartengono a due categorie principali
Background
Foreground
Conoscenza, esperienza
Modificato da: Guyatt GH, et al. AMA Press, 2001
GIMBE®
Perché ci interessa questa distinzione?
Per orientare diversamente la ricerca
Background questions
Sono i quesiti di base che vengono formulati nelle situazioni in cui
l'argomento è poco, o per nulla, conosciuto.
E' un "bisogno di informazione" caratteristico del giovane professionista.
Il professionista esperto, peraltro, formula quesiti di base quando la
malattia/condizione/tecnologia è:
– estranea al proprio settore professionale/specialistico
– di recente introduzione (tecnologia) o descrizione (malattia)
– rara
Verso linee guida
GIMBE®
Perché ci interessa questa distinzione?
Per orientare diversamente la ricerca
Foreground questions
Sono quesiti specifici, generalmente formulati dal
professionista esperto.
Verso la letteratura secondaria: Revisioni sistematiche
Verso la letteratura primaria: Medline, Cinahl, Embase
GIMBE®
Quando nasce un quesito di
foreground?
Categoria
Eziologia/Rischio
Diagnosi
Quesito ed esempio
Qual'è la responsabilità eziologica del fattore di rischio X nell'insorgenza della malattia Y?
L'utilizzo del telefono cellulare aumenta il rischio di neoplasia cerebrale?
Qual'è l’accuratezza del test diagnostico X (rispetto al gold-standard Y) nella diagnosi
della malattia Z?
Qual'è l'accuratezza diagnostica della risonanza magnetica nucleare nei pazienti con
sospetta lesione del menisco?
Qual'è la storia naturale della malattia X e la potenza dei fattori prognostici?
Prognosi
In un paziente con neurite ottica, quali sotto i fattori prognostici (favorevoli e sfavorevoli)
che condizionano l'evoluzione in sclerosi multipla?
Qual'è l'efficacia del trattamento X (preventivo, terapeutico o riabilitativo), rispetto al
trattamento Y, nella malattia Z?
Trattamento
GIMBE®
In un paziente con osteoartrosi in trattamento cronico con FANS (malattia/condizione),
l'omeprazolo (intervento), rispetto al misoprostolo (confronto) è in grado di prevenire
l'ulcera da FANS sintomatica e le sue complicanze (evento)?
Quesito di foreground
formulare il quesito in modo preciso ed
articolato (es PICOM)
P
I
C
O
M
GIMBE®
patient
intervention
comparison
outcome
metodology
Esempio di ricerca
Punto di partenza: quesito in forma
“narrativa”:
“Il saccarosio può essere impiegato come
analgesico per i neonati che devono essere
sottoposti a procedure dolorose
(es.: puntura del calcagno …)?”
GIMBE®
Trasformazione con PICOM
Patient: neonati da sottoporre a procedure
dolorose.
Intervention: analgesia con saccarosio.
Comparison: altri trattamenti non farmacologici
(“pacifiers”, ecc.).
Outcome: ridotta percezione del dolore da parte
del neonato.
Metodology: Randomized Controlled Trial
GIMBE®
PER OGNI DOMANDA UN DISEGNO
QUESITO
GIMBE®
DISEGNO DELLO STUDIO
EZIOLOGIA
• COORTE
• CASO CONTROLLO
• REVISIONI SISTEMATICHE
PROGNOSI
• STUDI LONGITUDINALI DI COORTE
(SENZA COORTE CONCORRENTE)
DIAGNOSI
• STUDI TRASVERSALI DI CONFRONTO
INDIPENDENTE E CIECO CON IL GOLD
STANDARD DIAGNOSTICO
TRATTAMENTO
• RCT
Paziente
o
problema
Chiedersi:
“come descriverei un gruppo di pazienti
simili al mio?”
Bilanciare la precisione con la brevità
GIMBE®
Intervento
Chiedersi
o
“Quale intervento principale/fattore di
esposizione
rischio sto prendendo in
(una causa, un
considerazione?”
fattore prognostico,
un trattamento)
Essere specifici
GIMBE®
Intervento di
controllo
(se necessario
e/o possibile)
GIMBE®
Chiedersi
“Qual’è la principale alternativa da
comparare all’intervento prescelto?”
Essere specifici
Outcome
Chiedersi
“Che cosa spero di ottenere?”
o
“Che cosa questo intervento può
realmente produrre?”
Essere specifici
GIMBE®
Metodology
GIMBE®
Chiedersi
“Quale disegno di studio costituisce
il gold standard per il mio quesito?”
Perché è importante chiarire bene il
quesito?
Per due motivi fondamentali:
–
–
GIMBE®
dal quesito vengono estratte le parole chiave
che useremo nella ricerca.
dalla chiara espressione del quesito possiamo
definire i criteri di inclusione o di esclusione,
tra i quali capire quale tipo di studio dobbiamo
cercare.
Esercitazione del quesito: diagnosi
Lavori come infermiere all’interno di un ambulatorio
di assistenza di base. Usi abitualmente diversi
strumenti validati per la diagnosi di depressione.
Una collega ti parla di uno strumento analogo, ma
composto da due sole domande, che secondo lei è
efficace per la individuazione della depressione
maggiore ed il cui uso, essendo così semplice,
permetterebbe di risparmiare molto tempo. Decidi
quindi di compiere una ricerca su diversi strumenti
per la diagnosi di depressione e sulle loro proprietà.
GIMBE®
Esercitazione del quesito
Popolazione Intervento Comparazione Outcome
Rispetto ad
L’uso di
In pazienti
altri strumenti
con sospetta uno
depressione strumento validati
a due
domande
GIMBE®
Che livello
di
accuratezza
permette?
Esercitazione del quesito: prognosi
Stai assistendo una paziente di 28 anni che è appena
stata sottoposta a dilatazione e curettage dopo aborto
spontaneo. Era di 10 settimane ed era alla sua prima
gravidanza. In un incontro di team, uno dei tuoi
colleghi ricorda vagamente di aver letto un articolo
inerente la depressione dopo l’aborto. Decidi di
compiere una ricerca per verificare se nella tua
paziente sono riscontrabili i fattori prognostici di severa
e prolungata depressione.
GIMBE®
Esercitazione del quesito: prognosi
Popolazione
Intervento
In una
paziente sana
che ha
recentemente subito un
aborto
L’individuazione di
quali fattori
prognostici
GIMBE®
Comparazio- Outcome
ne
Consente
di
individuare
uno stato
depressivo
non
fisiologico
Esercitazione del quesito: terapia
Lavori
come
infermiere
scolastico
e
visiti
quotidianamente molti bambini delle scuole elementari
e medie (di età compresa tra i 5 ed i 13 anni). È
inverno, per cui sono molto diffusi influenza e
raffreddore. Una delle insegnanti ti ferma nell’atrio
della scuola e ti chiede un consiglio per la sua
bambina di 10 anni che è raffreddata. L’insegnante ha
sentito dire che lo zinco in compresse può aiutare ad
alleviare i sintomi del raffreddore: ti chiede se è
realmente efficace e se è opportuno darlo ai bambini.
GIMBE®
Esercitazione del quesito: terapia
Popolazione Intervento
Nei bambini
con
raffreddore
GIMBE®
Comparazio- Outcome
ne
Sono
L’assunzioefficaci e
ne di
?
sicure per
compresse
alleviare i
di zinco
sintomi del
raffreddore?
Esercitazione del quesito: eziologia
Sei un infermiere territoriale. Una tua paziente è
una donna di 29 anni che ha assunto contraccettivi
orali negli ultimi 6 anni. Ti telefona per chiederti un
consiglio: è molto in ansia dopo avere letto che
l’assunzione degli estroprogestinici aumenta il
rischio di malattie cardiovascolari. Lei non ha mai
fumato, è in buona salute e non ci sono casi di
malattie cardiache nella sua famiglia. Chiede di
vederti per discutere se è il caso di sospendere
l’assunzione della pillola e passare all’uso di un altro
metodo anticoncezionale. Nel prepararti all’incontro
decidi di eseguire una ricerca.
GIMBE®
Esercitazione del quesito: eziologia
Popolazione Intervento
Comparazione
Nelle donne L’assunzione Rispetto ad
altri metodi
in età fertile di
contraccettivi contraccett
orali a base ivi non
estroproge
di
estroprogesti stinici
nici
GIMBE®
Outcome
Aumenta il
rischio di
insorgenza di
malattie
cardiovascolari?
GIMBE
THE END
GIMBE®
®
Gabriele Manzi
GIMBE®
Architettura della
ricerca clinica
Studi Epidemiologici
GIMBE®
BIOMEDICA
RICERCA
PRIMARIA
• Studi
Osservazionali
• Studi
Sperimentali
RICERCA
SECONDARIA
FONTI
TERZIARIE
• Revisioni
Sistematiche
• Linee Guida
• Analisi
economiche
• Revisioni
tradizionali
• Trattati
• Editoriali
• Parere degli
esperti
GIMBE®
The 6S hierarchy of
pre-appraised evidence
Computerized decision support systems
Evidence based clinical practice guideline / textbooks
Evidence based abstraction jourmal
Systematic reviews
Evidence based abstraction jourmal
Original jourmal articles
DiCenso A, Bayley L, Haynes RB. Evid. Based Nurs. 2009;12;99-101
Ricerca sanitaria
Systematic Review
Evidence
Randomized
Controlled
Studies
Mega Trial
Cohort Studies
Case Control Studies
Case Series
Case Reports
Ideas, Editorials, Opinions
Non
evidence
GIMBE®
Animal research
In vitro (‘test tube’) research
Architettura della ricerca clinica
• Studi osservazionali
–descrittivi
• case report
• case series
GIMBE®
• Studi osservazionali
– Analitici
• Studi caso controllo
• Studi di coorte
–Con coorte parallela
»Prospettici
–Senza coorte parallela
»Prospettici retrospettivi
• Studi trasversali/cross sectional
GIMBE®
Quali sono?
QUESITO
COORTE
EZIOLOGIA
PROGNOSI
CASO
CONTROLLO
REVISIONI SISTEMATICHE
STUDI
LONGITUDINALI DI COORTE
STUDI
DIAGNOSI
TRATTAMENTO
TRASVERSALI DI CONFRONTO
RCT
REVISIONI
SISTEMATICHE
Per ogni domanda un disegno
GIMBE®
Studi che verificano l’eziologia
• Es: vogliamo verificare se avere un indice
di massa corporea basso (BMI)
(esposizione) è un fattore di rischio per le
lesioni da decubito (esito).
GIMBE®
Come posso rispondere a questa
domanda?
Attraverso tre disegni di studio:
studi trasvesrale/cross sectional
studi caso controllo
studi di coorte
GIMBE®
Studio Cross Sectional
Studio che raccoglie informazioni relative
all’esposizione ai fattori di rischio ed ai loro esiti
(positivi o negativi) NELLO STESSO
MOMENTO e sullo stesso paziente.
GIMBE®
Studio Cross Sectional
Sono studi che rilevando i dati in un
preciso momento realizzano la fotografia
del fenomeno.
Attraverso questi studi si rileva la
PREVALENZA di una patologia, ossia
l’insieme di tutti i casi esistenti in un
determinato momento, in una determinata
popolazione.
GIMBE®
Studio Caso-Controllo
Sono studi retrospettivi che partendo dalla
presenza, o meno, dell’esito (endpoint)
raccolgono
informazioni
relative
all’esposizione ai fattori di rischio.
Oggi
Fattori di
rischio
GIMBE®
End point sì
End point no
Studio di Coorte
Sono
studi
prospettici
che
partendo
dall’esposizione ai fattori di rischio o meno,
verificano se insorge l’esito.
Oggi
Fattori di
rischio sì
Fattori di
rischio no
GIMBE®
End point
Studi osservazionali analitici
esposti
outcome +
CASI
ESPOSTI
non esposti
outcome Soggetti
dello studio
esposti
outcome +
CONTROLLI
NON ESPOSTI
non esposti
outcome direzione dell’osservazione
CASO-CONTROLLO
COORTE PROSPETTICA
Cosa ci serve sapere per
leggere gli studi
osservazionali?
Dove può essere
l’errore/imbroglio?
GIMBE®
Individuare i bias
In uno studio il termine bias si riferisce a
qualsiasi errore sistematico che porti ad
una stima errata dell’associazione tra
l’esposizione e l’esito.
In uno studio osservazionale i puù
frequenti bias che possiamo incontrare
sono:
– Information bias
– Recall bias (studi caso controllo)
– Selection bias
– Fattori di confondimento
GIMBE®
Information bias
Information bias/misclassification – si realizza
tutte le volte in cui il paziente è posto nel
gruppo sbagliato a causa della errata
rilevazione del parametro considerato (es.
misurazione errata di peso ed altezza e
collocazione erronea nel gruppo sbagliato).
GIMBE®
Information bias
Esso si riduce in due modi:
– Utilizzando misure di rilevazione oggettive.
– Facendo in modo che chi effettua la
rilevazione dell’esposizione non sia a
conoscenza dell’esito (studio caso controllo) e
che chi rileva l’esito non sia a conoscenza
della condizione di esposizione all’agente
indagato (studio di coorte).
GIMBE®
Recall bias
E’ proprio degli studi caso controllo e si
verifica a causa dell’errato ricordo dei
soggetti coinvolti nello studio.
GIMBE®
Esempio :
confronto tra
quanto riferito dai
pazienti e quanto
rilevato dalla
documentazione
clinica.
Coloro che non hanno
sviluppato ldp hanno un
ricordo meno accurato e
più positivo rispetto la
durata della propria
degenza (in realtà 13
anziché 10 hanno avuto
degenze >10 gg).
GIMBE®
Esempio:
confronto tra
quanto riferito dai
pazienti e quanto
rilevato dalla
documentazione
clinica.
Questo tipo di bias
modifica in modo
consistente i risultati
della associazione,
spostandola
dall’ipotesi nulla!
GIMBE®
Selection bias
I soggetti selezionati devono essere identici ad
eccezione dell’esposizione all’agente eziologico.
Quando questo non si realizza siamo in presenza
del SELECTION BIAS.
In uno studio caso controllo, in cui si parte da due
gruppi con e senza l’esito, cosa accadrebbe se si
fossero scelte solo persone non esposte al reale
agente eziologico?
In uno studio di coorte, l’allocazione alla condizione
di esposizione si riferisce allo sviluppo dell’esito (o
nei due gruppi ci sono altre differenze non
considerate)?
GIMBE®
Follow up
Negli studi di coorte i soggetti inclusi nei
gruppi devono essere seguiti per un
periodo di tempo sufficientemente lungo.
Cosa accadrebbe se il follow up fosse
troppo breve, o in esso fossero persi molti
soggetti?
Una perdita nel follow up superiore al 20%
compromette i risultati dello studio.
GIMBE®
Se nel compimento di uno studio è stato
effettuato un bias i suoi risultati possono
essere definitivamente compromessi.
Gli studi di coorte da questo punto di vista
sono più sicuri per qul che riguarda
l’information bias o il recall bias, ma non
sempre sono realizzabili (condizioni rare o
a lentissimo decorso).
GIMBE®
Errors in epidemiological studies
Error
Random error (chance)
Systematic error (bias)
Study size
GIMBE®
Rothman, 2002
Fattori di confondimento
Esiste una relazione tra l’assunzione di
caffè e l’insorgenza del tumore del
polmone?
GIMBE®
I fattori di confondimento sono le variabili
associate sia all’esposizione, sia all’esito,
che potrebbero spiegare qualsiasi
relazione osservata tra questi due fattori.
Esposizione
Esito
Fattore di
confondimento
GIMBE®
Al contrario dei bias, i fattori di
confondimento possono essere corretti
cercando di escludere dal campione
coloro che possono recarli con sé.
Purtroppo però i fattori di confondimento
sono numerosi, per cui è difficile escluderli
completamente dagli studi (e non è detto
che possano essere identificati).
GIMBE®
Per ridurre l’effetto delle variabili di
confondimento esistono diverse tecniche
in fase di campionamento o in fase di
analisi.
GIMBE®
Accorgimenti in fase di
campionamento
Esclusione dei soggetti che presentano variabili
di confondimento.
Matching negli studi caso controllo, che consiste
nell’introdurre nel gruppo di controllo persone
con le stesse variabili di confondimento presenti
nel gruppo dei casi.
Stratificazione, che consiste nell’esaminare
l’associazione in modo separato nei diversi
gruppi.
GIMBE®
Accorgimenti in fase di analisi
Regressione.
Si tratta di un’elaborazione statistica che
ricalcola le associazioni tenendo conto
dell’effetto delle variabili di
confondimento.
®®
GIMBE
GIMBE
Centro Studi EBN - Bologna
Esempio
Esiste una relazione tra il sesso maschile
e l’insorgenza delle lesioni da decubito?
Lo studio di Margolis e al, rispetto al
genere maschile e le lesioni da decubito,
presenta un rischio relativo non adattato
pari a 0,78, (CI 95% 0,70-0,88), quindi per
gli uomini esisterebbe il 22% in meno di
rischio di sviluppare lesioni da decubito.
GIMBE®
Esempio
Tuttavia, considerando i fattori potenziali
di confondimento, quali l’età e le
patologie concomitanti, il rischio relativo
diventa 1,01 (CI 95% 0,89-1,15).
Quindi, cosa si può concludere?
GIMBE®
Misure di associazione
• Negli studi sulle cause, vogliamo
conoscere la relazione tra alcuni fattori di
rischio dei pazienti (le esposizioni) e una
particolare condizione o patologia (l’esito).
• La relazione tra i fattori di rischio e gli esiti
viene generalmente presentata come
rischio relativo, negli studi di coorte, o
come odds ratio negli studi caso controllo.
GIMBE®
Tabellare i dati
Evento Sì Evento No
Gruppo E
a
b
a+b
Gruppo C
c
d
c+d
a+c
b+d
L’odds ratio (OR)
a/b
OR = -----------c/d
GIMBE®
Rischio Relativo (RR)
a
EER
---------a+b
RR = ------------ = ------------c
CER
---------c+d
Esempio ipotetico
Rischio Relativo (RR)
a
EER
----------0,073
a+b
RR = ------------ = --------------- = ----------- = 1,37
c
CER
---------0,053
c+d
GIMBE®
Un risk ratio di 1,37
indica che le persone
con un BMI basso
hanno un rischio di
1,37 volte maggiore (o
del
37%)
di
presentare
l’esito,
rispetto a quelli con
elevato BMI.
Esempio ipotetico
Odds Ratio (OR)
a/b
0,078
OR = ------------ = ----------------- = 1,39
c/d
0,056
GIMBE®
Un odds ratio di 1,39
indica che le persone
con un BMI basso
hanno una probabilità
di 1,39 volte maggiore
(o
del
39%)
di
presentare
l’esito,
rispetto a quelli con
elevato BMI.
Misure di associazione
• Il RR si utilizza negli studi di coorte.
• L’OR si utilizza negli studi caso controllo.
• Non è tanto importante capire quando
bisogna usare l’uno o l’altro, ma sapere
che cosa indicano.
GIMBE®
Interpretazione
• RR/OR<1 - l’esposizione protegge rispetto
l’insorgenza del danno
• RR/OR>1 – l’esposizione favorisce
l’insorgenza del danno
• RR/OR=1 – l’esposizione non produce
effetti
GIMBE®
Intervallo di confidenza (IC)
GIMBE®
Ipotesi nulla
Si verifica quando i parametri di associazione
(RR o OR) assumono valore uguale a 1 o
quando l’intervallo di confidenza comprende
il valore 1.
GIMBE®
I risultati sono validi?
Per valutare la validità di uno studio
sull’eziologia (danno) di una particolare
condizione, abbiamo bisogno di valutare
quanto abbiano influito, sui risultati dello
studio, 3 fattori:
il caso
i bias
i fattori di
confondimento
GIMBE®
Test di significatività statistica
… lo zampino del caso
Oltre a verificare i valori delle misure di
associazione, e del loro IC, occorre
osservare il valore assunto da “p”, che
ci dice quale sia la significatività
statistica dei risultati ottenuti.
GIMBE®
Significatività statistica
GIMBE®
Per ogni quesito un disegno di
studio ideale
QUESITO
EZIOLOGIA
COORTE
CASO CONTROLLO
PROGNOSI
STUDI LONGITUDINALI DI COORTE
DIAGNOSI
STUDI TRASVERSALI DI CONFRONTO
TERAPIA
RCT
GIMBE®
Ricerca sanitaria
Systematic Review
Evidence
Randomized
Controlled
Studies
Mega Trial
Cohort Studies
Case Control Studies
Case Series
Case Reports
Ideas, Editorials, Opinions
Non
evidence
GIMBE®
Animal research
In vitro (‘test tube’) research
• Studi sperimentali
– Non controllati
– Controllati
• Non randomizzati
• Randomizzati
GIMBE®
Popolazione e campione
Il campione viene definito individuando
criteri di inclusione e di esclusione. Essi
sono utilizzati per selezionare il campione
dall’insieme di tutte le unità.
GIMBE®
Campione
Il campione deve essere corrispondente
alle caratteristiche definite dal ricercatore
e deve essere rappresentativo in termini
quali-quantitativi della popolazione.
Solo in questo modo sarà possibile trarre
inferenze (generalizzazioni) sulle
popolazione dallo studio effettuato su una
piccola parte di essa.
GIMBE®
Campione
Un campione statisticamente
rappresentativo deve essere
sufficientemente numeroso, omogeneo e
scelto a caso.
GIMBE®
Campionamento casuale
Per ottenere delle informazioni attendibili su una
popolazione utilizzando un campione, è
necessario che il campione sia ottenuto in modo
casuale.
Perché un campione possa essere considerato
casuale, è indispensabile che ogni membro della
popolazione in oggetto abbia la stessa
probabilità di essere selezionato.
Un campione non casuale non dà alcun
affidamento: in altre parole tutte le inferenze da
esso tratte non sono da prendersi sul serio.
GIMBE®
Campionamento casuale
Al fine di realizzare un campionamento
casuale occorre ricorrere alla
randomizzazione.
Essa può essere realizzata in diversi modi:
– Randomizzazione semplice
– Randomizzazione stratificata
– Randomizzazione a grappolo
– Randomizzazione sistematica
GIMBE®
• Studi sperimentali
– Non controllati
– Controllati
• Non randomizzati
• Randomizzati
GIMBE®
Trial Randomizzati Controllati
Esposizione
Outcome
Gruppo
sperimentale
Campione
di
popolazione
No outcome
Randomizzazione
Outcome
Gruppo
controllo
No outcome
Direzione della raccolta dati
Esposizione
GIMBE®
Outcome
Anatomia
dell’RCT
Outcome
Gruppo Sperimentale
No
Outcome
Outcome
Gruppo di Controllo
GIMBE®
No
Outcome
La cecità per vedere bene
Singolo cieco: solo il soggetto sotto sperimentazione non
sa quale trattamento sta ricevendo
Doppio cieco: sia il soggetto sotto sperimentazione che
gli sperimentatori non sanno quale trattamento si
assume/somministra
Doppio simulato: metodica utilizzata per mantenere la
cecità quando si confrontano farmaci somministrati per
vie diverse tra di loro
In aperto: sperimentazione senza cecità. Accettabile
quando si confrontano terapie diverse e/o complesse
GIMBE®
La cecità per vedere bene
¾ Non conoscere il trattamento che si riceve (paziente)
o che si somministra (sperimentatore) impedisce di
essere influenzati dalle aspettative che si hanno
rispetto al trattamento
¾ Altrettanto
importante è essere ciechi nella
valutazione dei risultati della sperimentazione,
soprattutto se gli end-points sono sfumati o di difficle
interpretazione (triplo cieco)
GIMBE®
I RCTs rappresentano lo standard sperimentale per valutare
l'efficacia dei trattamenti, grazie alla loro capacità di
minimizzare i bias rispetto ad altri disegni sperimentali
L'elemento che contraddistingue i RCTs è l'assegnazione
casuale dei pazienti al trattamento da sperimentare o al
trattamento di controllo, attraverso le procedure di
randomizzazione.
Questa assicura che tutti i fattori prognostici - sia noti che non
conosciuti - vengono equamente distribuiti nei due gruppi di
pazienti, tra i quali l’unica differenza risulta essere l’intervento
terapeutico in studio.
GIMBE®
Se lo studio è condotto ed analizzato in maniera adeguata,
l’eventuale differenza di esiti tra i due gruppi può essere
attribuita all’effetto del trattamento in studio.
Esistono alcune condizioni per cui i RCTs non costituiscono il
miglior disegno sperimentale:
- storia naturale delle malattie Æstudi di coorte
- eziologia, fattori di rischio Æ studi di coorte, studi casocontrollo
- accuratezza diagnostica Æ studi trasversali
GIMBE®
Fasi degli studi sui
farmaci
Da dove nascono gli RCTs
GIMBE®
Fasi degli studi farmacologici
• Nella fase I viene definita la massima dose
tollerata e gli effetti tossici di un nuovo
farmaco già sperimentato in laboratorio su
animali (si svolgono pochi pazienti, <30).
• Lo scopo della fase II è stabilire l’efficacia
del farmaco in un gruppo maggiore e
mirato di pazienti (massimo 100).
GIMBE®
Fasi degli studi farmacologici
• La fase III rappresenta i tipici effettivi trials,
in quanto si cerca di comparare il nuovo
farmaco con un farmaco esistente o un
intervento ritenuto efficace.
• La fase IV riguarda gli studi che cercano di
monitorare gli effetti indesiderati di un
nuovo farmaco dopo che è stato
approvato per il marketing.
GIMBE®
Necessità di ricorrere
agli RCTs
Ma è proprio indispensabile
confrontare i risultati di due
gruppi?
GIMBE®
Trial non controllati
Il trattamento sperimentale viene
assegnato a tutti i pazienti eleggibili
consecutivamente osservati. Sono ad
esempio le serie di casi o gli studi di fase I
e II.
GIMBE®
In assenza del gruppo di controllo, i
benefici del trattamento sperimentale
vengono misurati confrontando i risultati
ottenuti al decorso della malattia non
trattata (o trattata con terapia standard).
I benefici del trattamento sperimentale
vengono misurati riportando i benefici
assoluti del trattamento (ad es. il 98% dei
pazienti con polmonite trattati con
l’antibiotico X è guarito)
GIMBE®
Pazienti eleggibili
GIMBE®
Esposizione:
Intervento
sperimentale
Outcome
Follow up
• Principali fonti bias sono:
- variabilità prognostica e di decorso della
malattia.
- effetto placebo.
- aspettative ottimistiche del medico e del
paziente.
GIMBE®
L’effetto totale del trattamento è la somma
del miglioramento spontaneo, delle risposte
aspecifiche e della risposta specifica
Effetto reale del trattamento
Effetto placebo
Aspettative ottimistiche
Effetto Hawthorne
Miglioramento spontaneo
GIMBE®
Effetto
reale
del
trattamento
Effetto reale del trattamento
Dipende dai benefici o dai danni indotti dal
trattamento in uso, può essere influenzato
anche dalla storia naturale della malattia che
porterà il paziente a migliorare o peggiorare
comunque
GIMBE®
Effetto placebo
Induce il paziente a stare meglio per la
consapevolezza di essere trattato
L’effetto placebo può essere maggiore se il
paziente ha fiducia nel medico o nella
struttura sanitaria a cui si rivolge o se ha
grande speranza che il nuovo trattamento
sia efficace
GIMBE®
Apettative ottimistiche
Le aspettative ottimistiche del medico
derivano dalla sua fiducia nel trattamento
che sta sperimentando e che lo inducono a
interpretare ottimisticamente i risultati
terapeutici accentuando l’effetto placebo
GIMBE®
Effetto Hawthorne
Induce il paziente a stare meglio per la
consapevolezza di essere parte di uno
studio
GIMBE®
In una certa epoca storica hanno prodotto
evidenze inconfutabili per trattamento
drammatica efficacia in condizioni ad esito
invariabilmente fatale/sfavorele:
– insulina nel coma diabetico.
– penicillina nella polmonite pneumococcica (o
nella endocardite batterica).
– vitamina B12 nell’anemia perniciosa.
– appendicectomia nell’appendicite acuta.
GIMBE®
Oggi si utilizzano per:
– efficacia del nuovo trattamento
drammaticamente evidente in osservazioni
iniziali.
– patologie a decorso sfavorevole (arresto
cardiaco, epatite fulminante, …).
– patologie rare.
– studi di fase I e II.
– assenza di trattamenti alternativi di confronto.
– presupposti fisiopatologici convincenti.
Byar DP. NEJM 1990; 323: 1343
GIMBE®
• Smith GC, Jill JP.
Parachute use to prevent death and major
trauma related to gravitational challenge:
systematic review of randomised
controlled trials. BMJ, Dec 2003; 327:
1459 - 1461.
GIMBE®
Parachutes reduce the risk of injury
after gravitational challenge, but their
effectiveness has not been proved
with randomized contolled trials.
As with many interventions intended to prevent ill
health, the effectiveness of parachutes has not been
subjected to rigorous evaluation by using randomised
controlled trials.
Advocates of evidence based medicine have criticised
the adoption of interventions evaluated by using only
observational data.
We think that everyone might benefit if the most
radical protagonists of evidence based medicine
organised and participated in a double blind,
randomised, placebo controlled, crossover trial of the
parachute.
GIMBE®
Mega-Trial
• Sempre più frequenti
• Dimensione del campione > 1.000 - > 10.000
• Multicentrici >100 - > 1.000
• Criteri di inclusione larghi
• Disegno semplice, registrati solo dati essenziali
• End points non equivocabili (es: mortalità)
• Grande potenza statistica: possono evidenziare
differenze di efficacia minime fra i trattamenti a
confronto.
Topol EJ & Califf RM. Br Heart J 1992; 68: 348
GIMBE®
Dai trial clinici ai mega-trial
Magnesio nell’infarto miocardico
Razionale: variazione nell’andamento di patologie cardiache in funzione
della quantità di magnesio nell’acqua
Studi su animali hanno mostrato l’attività antiaritmica, antiaggregante e
coronarodilatatrice del magnesio
Piccoli trials positivi sull’uso del magnesio nell’infarto
Una review informale dei risultati ha mostrato una riduzione della
mortalità da IMA (infarto acuto del miocardio)
Una meta-analisi formale su 1300 pazienti con un totale di 78 decessi
ha mostrato una riduzione del 55% nel rischio di morte (p=0.001)
Studio LIMIT-1 su 100 paz. - diminuzione delle aritmie (1986)
Studio LIMIT-2 su 2300 paz. - diminuzione dell’incidenza di insuff.
ventricolare sinistra (1994)
GIMBE®
Il mega-trial ISIS-4 (1995)
58.050 pazienti entro 24 ore dall’infarto in 1086 ospedali
- captopril per 1 mese vs placebo
- nitoderivati per un mese vs placebo
- magnesio solfato IV 24 h vs controllo
Risultati
Il captopril previene 5 morti su 1000, i nitroderivati e il
magnesio sono inefficaci
GIMBE®
Lancet, 1995 Mar 18;345(8951):669-85.
GIMBE®
Trial controllati non randomizzati
Sono caratterizzati dalla presenza di un gruppo di pazienti di controllo, i cui esiti vengono confrontati con quelli del gruppo dei pazienti trattati
L'assegnazione dei pazienti all'uno o all'altro gruppo avviene con una delle seguenti procedure:
‐ controlli paralleli (contemporanei) ‐ controlli storici
‐ controlli da banche dati
GIMBE®
I limiti comuni a tutti i CCTs sono:
• lo sbilanciamento dei fattori prognostici tra i due gruppi di pazienti
• la tendenza ad assegnare al trattamento sperimentale i pazienti a prognosi più favorevole
• la sovrastima dell'efficacia del trattamento sperimentale, accentuata nelle CCTs con controlli storici dal miglioramento delle condizioni assistenziali
Per tali ragioni le CCTs, non dovrebbero più occupare alcun ruolo nella moderna ricerca clinica (vedi Consort Statement II)
Costituiscono a tutti gli effetti una frode scientifica
Copyright © - GIMBE®
Cosa dobbiamo sapere
quando leggiamo gli
RCT
Dove può essere
l’imbroglio?
GIMBE®
Bias negli RCTs
• Benchè costituiscano gli studi più rigorosi,
in essi la presenza di errori sistematici
(bias), compromette la credibilità e
l’autorevolezza dei risultati.
• I più comuni bias possono essere:
– bias di selezione
– bias di accertamento
– bias di violazione del protocollo
– bias di pubblicazione
– bias temporali
GIMBE®
Selection bias
• Accade quando gli outcome di uno studio
sono influenzati dalle differenze
sistematiche (reali e non casuali) dei due
gruppi di studio, che dipendono dalle
modalità con cui le persone sono state
selezionate a partecipare al trial o con cui
sono state assegnate ai due gruppi.
• La prevenzione di questo tipo di bias si
realizza con una corretta
randomizzazione.
GIMBE®
Selection bias
• Se uno studio è realmente randomizzato,
l’allocazione ai gruppi non può essere
influenzata dai ricercatori o dai
partecipanti allo studio.
• Con la randomizzazione a tutti i
partecipanti dello studio sono date le
stesse opportunità di essere assegnati ad
ognuno dei gruppi di studio.
GIMBE®
Ascertainment bias
• Questo tipo di bias accade quando i risultati o le
conclusioni di un trial sono distorte dalla conoscenza
relativa all’intervento a cui ogni partecipante è stato
sottoposto. Questo tipo di errore può essere
determinato da chi eroga l’intervento, dai soggetti
studiati, dai ricercatori che valutano o analizzano gli
outcome e coloro che scrivono l’articolo descrivente
il trial.
• Il miglior modo per evitare questo bias è mantenere
le persone, coinvolte nello studio, non a conoscenza
dell’identità dell’intervento il più a lungo possibile,
realizzando quella che è detta CECITA’.
GIMBE®
Bias di violazione di protocollo
Lost to follow‐up Withdrawal Crossover
GIMBE®
(persi al follow up)
(sospendono il trattamento)
(passano all’altro gruppo)
Tutti i pazienti randomizzati devono essere inclusi nell’analisi alla fine dello studio (sia i persi al follow‐up, sia quelli che hanno sospeso il trattamento)
Ciascun paziente deve essere analizzato nel gruppo originale di randomizzazione, anche se poi ha ricevuto il trattamento dell’altro braccio (cross‐over)
GIMBE®
• Ciò si giustifica con il fatto che, se il
trattamento procura effetti indesiderati tali
da non consentire al soggetto di seguire
correttamente la terapia, considerare il suo
esito negativo nel gruppo di controllo, ne
accrescerebbe l’efficacia.
• Questo tipo di elaborazione dei dati si
chiama analisi secondo “intention to treat”.
GIMBE®
Follow up
• Ogni paziente reclutato dovrebbe essere
considerato fino alla fine, ma purtroppo
non è sempre così, a causa, ad esempio
del ritiro del pz dallo studio.
• L’abbandono degli studi non avviene per
caso. Addirittura esso potrebbe essere
determinato dal trattamento stesso.
• Per questo quando il numero dei persi al
follow up non è uguale nei due gruppi, è
legittimo nutrire qualche sospetto.
GIMBE®
Follow up
• Dato che studiare solo coloro che sono rimasti
nello studio può portare a sovrastimare
l’efficacia del trattamento, gli autori dello studio
devono riportare nelle conclusioni tutti coloro
che erano stati reclutati, assegnando ai
soggetti persi del gruppo di trattamento l’esito
peggiore ed ai soggetti persi del gruppo di
controllo l’esito migliore.
• In questo caso il rischio è di sottostimare
l’efficacia del trattamento, il che è sicuramente
più accettabile del contrario.
GIMBE®
Follow up
Il limite tollerabile di persi durante il follow up è
del 20%: difficilmente uno studio che ha
dimostrato l’efficacia del trattamento, rimane
tale attribuendo un esito negativo a più del
20% dei pazienti.
GIMBE®
Publication bias
• Alcune evidenze dimostrano la probabilità
di trovare pubblicato uno studio, non è
casuale ma dipende dalla direzione e dalla
forza dei risultati della ricerca, per cui i
lavori con risultati statisticamente
significativi (positivi) sono pubblicati con
maggiore facilità rispetto a quelli con
risultati non significativi.
• Questa tendenza, che appare favorire i
trials con risultati positivi, è stata chiamata
pubblication bias.
GIMBE®
Publication bias
Uno studio randomizzato sulla lorcainide
(antiaritmico di classe I), dove si osservava un
aumento di morti fra i pazienti trattati, non
venne pubblicato per 13 anni.
Negli anni 80 questi farmaci causarono fra
20.000 e 70.000 morti premature negli USA
Tonks, BMJ dicembre 1999
GIMBE®
Publication bias
Lo studio VIGOR che ha promosso il VIOXX è
stato pubblicato nel novembre 2000 sul New
England Journal of Medicine
Bombardier C. et al. “Comparison of upper gastrointestinal toxicity of rofecoxib and naproxen in patients with
rheumatoid arthritis.” VIGOR Study Group. N Engl J Med. 2000 Nov 23;343(21):1520-8
Nel dicembre 2005 il NEJM informa i lettori
che gli autori dello studio VIGOR avevano
manipolato i dati dei risultati della ricerca
omettendo casi di gravi effetti avversi
Gregory D. et al. “Expression of Concern: Bombardier et al., ‘Comparison of Upper Gastrointestinal Toxicity of
Refecoxib and Naproxen in Patients with Rheumatoid Arthritis’ N Engl J Med 2000 Nov 23;343(21):1520-8”
N Engl J Med. 2005 Dec 29;353(26):2813-4.
GIMBE®
Publication bias
GIMBE®
Publication bias
Dal 1999 al 2004 il VIOXX ha indotto negli
USA circa 38.000 decessi supplementari
(Lancet 2005)
Stima per i paesi industrializzati: 140-160.000
decessi supplementari
GIMBE®
Publication bias
Nel maggio del 2006 il Wall Street Journal
informa che l’editore del NEJM era già a
conoscenza
nell’agosto
2001
della
manipolazione.
La ditta produttrice del VIOXX (Merck) aveva
acquistato nel novembre 2000 per la campagna
pubblicitaria 900.000 ristampe dell’articolo del
NEJM per circa 700-830.000 $
Il profitto per il NEJM si aggirava intorno ai
500.000$
Smith R. “Lapses at the new England journal of medicine.” J R Soc Med. 2006 Aug;99(8):380-2
GIMBE®
Bias temporali – Time lag bias
• Questo tipo di errore accade in quanto la velocità di
pubblicazione dipende dalla direzione e dalla forza
dei risultati del trial. In generale sembra che uno
studio con risultati negativi impieghi il doppio del
tempo per essere pubblicato rispetto ad uno con
risultati positivi.
• Inoltre possono passare anni dal momento in cui lo
studio è stato completato a quello in cui è
pubblicato, tanto che il tempo per realizzalo sembra
uguale a quello necessario per pubblicarlo e ciò
riguarda allo stesso modo gli studi grandi e gli studi
piccoli.
GIMBE®
Possiamo quindi fidarci della generalizzabilità degli RCTs?
In letteratura sono segnalati alcuni aspetti che
devono portarci a considerare gli RCTs
sempre con spirito critico. In particolare
sono segnalati:
– Popolazioni estremanente selezionate
(Gurwitz JH. Jama 1992).
– Competenza, motivazioni e condizioni
organizzativo-assistenziali ideali (Grilli R. Et
al, Il Pensiero Scientifico Editore,1995).
GIMBE®
Possiamo quindi fidarci della generalizzabilità degli RCTs?
– Risultati “medi” che non prevedono la risposta
nel paziente individuale (Rothweel PM.
Lancet 1995).
– End point misurati (surrogati vs clinicamente
significativi) (Fleming TR,et al. Ann.Inter.Med.
1996).
– Misure utilizzate per riportare i risultati
(relative vs assolute) (Bobbio M, et al. Lancet
1994).
GIMBE®
End point
9
Clinicamente rilevanti - comprendono, oltre la mortalità, tutti gli
eventi clinici morbosi che possono essere rilevati nella storia
naturale/post-terapeutica di una malattia
9
Surrogati – sono variabili anatomico-fisiologico-metaboliche
misurate con test di laboratorio/strumentali
9
Hard - di sicura determinazione, per la verifica dei quali l’errore è
minimo (mortalità)
9
Soft - possono essere influenzati da imprecisioni o soggettività
(es. miglioramento di un quadro sintomatologico, lettura di un
ECG/Rx)
GIMBE®
End point
End point surrogato
End point
rilevante
Encainide, Flecainide,
Moricizina: extrasistolia
ventricolare post-IMA (1)
extrasistolia
mortalità
Milrinone: insufficienza
eardiaca (2)
portata circolatoria
mortalità
densità minerale
ossea
incidenza
fratture
Trattamento
NaF: osteoporosi (2)
GIMBE®
(1)
Epstein AE & al. JAMA 1993; 270:2451
(2)
Da: Fleming TR & al Ann Intern Med 1996;125:605
Misure di efficacia negli
RCTs
GIMBE®
Misure di efficacia
Nella lettura degli studi troveremo indicate
diverse misure di efficacia.
La comprensione del loro significato,
l’interpretazione del loro valore e della loro
precisione ci servono per capire la forza e
la direzione della associazione che vi è tra
trattamento e outcome (e quindi di capire
se l’intervento è efficace e in che misura).
GIMBE®
Esempi
o
Cullen l, et al. Nebulized lidocaine decrease the discomfort of nasogastric
tube insertion: a randomized, doubleblind trial. Ann Emerg Med 2004; 44:
131-7.
La somministrazione di lidocaina nebulizzata prima dell’inserimento
del sondino naso gastrico riduce il disagio del paziente?
P: Adulti da sottoporre a posizionamento del SNG in pronto
soccorso.
I: nebulizzazione di lidocaina nella narice da utilizzare.
C: nebulizzazione di soluzione fisiologica.
O: disagio misurato attraverso una scala visuale analogica (misura
continua), difficoltà di inserimento del sondino (misura continua),
complicanze (sanguinamento, vomito, impossibilità di passaggio,
dispnea – misure discrete).
GIMBE®
Esempi
o
Lidocaina
Placebo
Differenza
(IC 95%)
Disagio riferito
(media dei
punteggi della
scala visuale
analogica)
37.7
59.3
26.6 (5.3 a 38.0)
Difficoltà
percepita
dall’infermiere
nell’inserimento
del SNG
2
2
0 (-1 a 1)
Sanguinamento
nasale
17%
0
17% (3,5-31)
Vomito
10%
0
10%(-0,7-21)
Outcome
GIMBE®
Misure di efficacia per gli esisti discreti
Il primo passaggio per capire le diverse misure di
efficacia consiste nel tabellare i dati.
Evento Sì Evento No
GIMBE®
Gruppo S
a
b
a+b
Gruppo C
c
d
c+d
a+c
b+d
Experimental Event Rate
- Frequenza (o rischio)
nel gruppo
sperimentale
a
EER =
a+b
Control Event Rate Frequenza (o rischio) nel
gruppo di controllo
c
CER =
c+d
GIMBE®
La EER ci dice con che
frequenza l’evento
indesiderato
si è verificato
nel gruppo sperimentale
La CER ci dice con che
frequenza l’evento
indesiderato
si è verificato
nel gruppo sperimentale
Rischio Relativo / RR
a
EER
a+b
RR =
=
c
CER
c+d
GIMBE®
Dal rapporto tra EER e CER
individuiamo il rischio relativo
che ci dice qual è il rischio di
manifestare l’evento negativo
nel gruppo sperimentale rispetto
a quello del gruppo di controllo.
Se l’intervento ha un effetto
benefico
RR<1
Se peggiora l’esito RR>1
Se non c’è differenza
RR=1
Odds ratio / OR
a/b
OR =
c/d
GIMBE®
A volte per motivi statistici
anziché essere calcolato il RR
troviamo indicato l’odds ratio
che non confronta le
frequenze
(rischio) degli eventi,
ma le loro probabilità.
se l’intervento ha un effetto
benefico
OR<1
Se peggiora l’esito OR>1
Se non c’è differenza
OR=1
Il difetto di RR e OR
Sia il RR che l’OR sono misure della forza
dell’associazione tra trattamento ed esito
RELATIVE. Esse, cioè, non ci danno
espressione della importanza clinica degli
eventi nei due gruppi.
Per questo motivo, per comprendere la
rilevanza clinica degli effetti, vengono
calcolate altre misure di efficacia.
GIMBE®
Riduzione Assoluta del
Rischio
ARR = CER
_
EER
Riduzione Relativa del
Rischio
ARR
RRR =
CER
GIMBE®
La riduzione del rischio
assoluto ci dà l’esatta
espressione
di quanto si riduca la frequenza
degli eventi negativi con
il trattamento
sperimentato.
La riduzione del rischio
relativo dice con che
proporzione diminuisce
il rischio con
il trattamento sperimentato.
I difetti di RRR e ARR
La RRR è di nuovo una misura relativa,
che non ci dà idea della frequenza, e
quindi dell’importanza, dell’evento
negativo.
Al contrario ARR è proprio espressione di
questa misura, ma essendo un numero
decimale non è facile da ricordare.
Per questo motivo la misura che rende al
meglio l’importanza clinica
dell’associazione tra trattamento ed
outcome è il NNT.
GIMBE®
Number Needed to Treat
1
NNT =
ARR
GIMBE®
Il numero dei pazienti da
trattare ci dice il numero di
quante persone è necessario
trattare per evitare un
evento negativo. Si tratta di
un valore espresso con
numeri interi (e quindi è più
facile da ricordare)
tanto più è piccolo tanto
maggiore è l’efficacia clinica
del trattamento.
Tanto più è grande
tanto più è modesta l’efficacia
del trattamento.
Un esempio reale
Trivedi DP, Doll R, Khaw KT. Effect of four montly oral
vitamin D3 supplementation on fracutres and mortality in
men and women living in the community: randomised
double blind controlled trial. BMJ 2003; 326:469-72.
La somministrazione di un integratore di Vitamina D3 per
via orale riduce l’incidenza di fratture negli anziani (6585) residenti al domicilio?
P: anziani residenti al domicilio.
I: somministrazione di un integratore di Vit D3.
C: somministrazione di un placebo.
O: Incidenza di fratture e mortalità da tutte le cause.
GIMBE®
Gruppo
intervento
Frattura
Non
frattura
N. Tot.
Trattamento
sperimentale
119
1226
1345
Trattamento
controllo
149
1192
1341
Rischio Relativo
EER
0,088
RR =
=
= 0,8
CER
0,11
Riduzione Assoluta del Rischio
ARR= CER - EER= 0,11 – 0,088 = 0,022
Riduzione Relativa del Rischio
ARR
0,022
RRR =
=
= 20%
CER
0,11
GIMBE®
Riduzione relativa del
rischio = di quanto si
riduce il rischio nel gruppo
sperimentale?
Riduzione assoluta del
rischio = differenza (%) fra
controllo e sperimentale
Number Needed to Treat
1
1
NNT =
=
= 44
ARR
0,022
Un esempio reale
HELSINKI HEART STUDY
New England Journal of Medicine 1987;
317:1237-45
RCT a doppio cieco della durata di 5 anni sulla riduzione del rischio di
malattie coronariche con la somministrazione di gemfibrozil.
Pazienti arruolati: 4081
gruppo sperimentale (gemfibrozil) 2051 pz.
gruppo controllo (placebo) 2030 pz.
gruppo sperimentale
eventi cardiaci
morti
56
45
gruppo controllo
eventi cardiaci
morti
84
42
Calcolare RR, ARR, RRR, NNT
GIMBE®
Spesso i risultati degli studi clinici sono presentati in termini di riduzione relativa del rischio (RRR) espressa come percentuale. RRR è un indicatore che, se non completato da ulteriori elementi, può enfatizzare l’efficacia di un trattamento, in particolare può far credere che il risultato sia più
rilevante del reale nella pratica medica.
GIMBE®
Studio A: 10.000 pazienti per gruppo; EER= 0,1 (10%); CER=0,2
(20%); RRR= 0,5 (50%)
Studio B: 10.000 pazienti per gruppo; EER= 0,01 (1%); CER=0,02
(2%); RRR= 0,5 (50%)
Studio C: 10.000 pazienti per gruppo; EER= 0,001 (0,1%); CER=0,002
(0,2%); RRR= 0,5 (50%)
ARR
Studio A: 0,1 (10%); Studio B: 0,01 (1%); Studio C: 0,001 (0,1%)
NNT
Studio A: 10 soggetti; Studio B: 100 soggetti; Studio C: 1000
soggetti
GIMBE®
La curva gaussiana
GIMBE®
Quando valutiamo l’efficacia di un
trattamento, oltre ad osservare il valore
della misura di efficacia, dobbiamo
valutarne la sua precisione.
Ciò si effettua considerando l’intervallo di
confidenza (IC).
GIMBE®
L’intervallo di confidenza (IC) fornisce la
misura della precisione dei risultati di una
ricerca, così da poter trarre deduzioni sulla
popolazione da cui sono tratti quei pazienti.
Per convenzione si è stabilito che la
precisione dell’intervallo di confidenza sia al
95%, lasciando sempre un piccolo - 5% margine di errore dovuto al caso.
GIMBE®
Stima l’intervallo dei valori all’interno del
quale, con una probabilità del 95%,
partendo dal risultato conseguito nel
campione, cade il reale valore della
popolazione studiata.
L’IC si fonda sull’idea che lo stesso studio,
condotto su differenti campioni di pazienti,
non porterebbe a risultati identici, ma questi
sarebbero distribuiti intorno al risultato vero
che resta sconosciuto.
GIMBE®
In particolare dobbiamo verificare:
– Quanto è ampio l’intervallo di confidenza.
– Che i suoi limiti non vadano oltre il valore che
conferma l’ipotesi nulla (assenza di
associazione).
GIMBE®
Tanto più è ampio l’intervallo di confidenza, tanto
meno precisa è la stima dell’associazione tra fattori
di rischio e condizione (e meno dobbiamo fidarci del
risultato dello studio).
L’ampiezza dell’IC è inversamente proporzionale
alla dimensione del campione (se sono stati inclusi
pochi
soggetti
il
risultato
ottenuto
è
necessariamente poco preciso).
GIMBE®
Ipotesi nulla
Coloro che eseguono lo studio mirano a
dimostrare l’esistenza di una relazione tra
il trattamento ed il miglioramento
dell’outcome, cercando di dimostrare
l’inesistenza dell’ipotesi nulla.
L’ipotesi nulla sussiste tutte le volte in cui
non vi è alcuna relazione tra il
miglioramento dell’esito ed il trattamento
studiato.
GIMBE®
Ipotesi nulla
Ipotesi nulla - H0
Ipotesi alternativa - H1
Le differenze osservate
Non esiste una reale
differenza tra le variabili sono troppo grandi per
essere dovute al caso.
studiate: le differenze
osservate sono dovute
la caso.
GIMBE®
Ipotesi nulla
Come faccio a verificare che l’ipotesi nulla
si è verificata o meno?
Attraverso l’esame dei valori delle misure
di efficacia, che non devono esprimere
specifici valori.
GIMBE®
Ipotesi nulla nei parametri discreti
Parametro
Valore nullo
RRR, RR, OR
1
ARR
0
NNT
Infinito
ARR non deve essere 0, perché
significherebbe che il beneficio
ottenuto in termini di riduzione
assoluta del rischio è 0, nel qual caso
NNT diventerebbe un valore tendente
ad infinito.
GIMBE®
Se questo valore
è 1, il rischio o la
probabilità di
incorrere
nell’evento
avverso sono
Uguali, sia che
si faccia il
trattamento o meno
Ipotesi nulla nei parametri continui
Nel caso in cui lo studio misuri parametri
continui, mediante la dimostrazione della
differenza della media, essa non deve
essere 0.
GIMBE®
Oltre a verificare i valori delle misure di
efficacia, e del loro IC, vi è anche un altro
parametro che occorre osservare che ci
dice quale sia la significatività statistica dei
risultati ottenuti.
Si tratta del test di significatività che
attribuisce il valore ad una variabile detta
“p”.
GIMBE®
Il valore di “p” misura la probabilità che la
associazione osservata possa essere
dovuta a fluttuazioni casuali, ossia alla
possibile variabilità tra campioni, e possa
quindi non corrispondere a fenomeni reali.
Quanto deve essere il valore di “p” per
poter escludere l’ipotesi nulla?
GIMBE®
Viene deciso dei ricercatori che indicano il
livello di significatività statistica stabilito.
Il livello di significatività per convenzione è
0,05. Si conclude cioè che una differenza non
è dovuta al caso se “p” è uguale o inferiore a
0,05.
Nel caso i ricercatori vogliano darsi una
maggiore rigorosità statistica possono portare
tale valore soglia a 0,01.
GIMBE®
È chiaro quindi che
– "Significativo" non è sinonimo di "importante“
– "Significativo" = difficilmente dovuto al puro
caso.
GIMBE®
La significatività statistica non è l’unico criterio utile per
decidere se utilizzare i risultati di uno studio. La
significatività statistica dipende dalla numerosità del
campione e dall’entità dell’efficacia del trattamento
(differenze degli esiti). Differenze clinicamente
irrilevanti possono divenire statisticamente significative
se le dimensioni dei campioni sono sufficientemente
grandi. Al contrario, differenze clinicamente rilevanti
possono essere statisticamente non significative se i
campioni sono troppo piccoli (in gergo statistico, se lo
studio manca di potenza statistica).
GIMBE®
Randomized Controlled
Trials - Critical Appraisal
Come si valutano i Trial
Randomizzati Controllati.
GIMBE®
Critical appraisal
Durante la lettura di una pubblicazione si
devono trovare informazioni sufficienti,
presentate chiaramente e coerentemente,
affinché si possa decidere, in base alle
evidenze fornite dagli autori, se le
conclusioni a cui arrivano sono logiche e
credibili.
GIMBE®
Critical appraisal
1. Validità interna
2. Rilevanza clinica
3. Applicabilità
4. Consistenza
GIMBE®
Validità interna
E’ diretta funzione del rigore metodologico seguito dai ricercatori nelle fasi di:
pianificazione e conduzione dello studio
analisi dei risultati
La validità interna ha l’obiettivo di minimizzare i bias che possono “distorcere” la validità interna della ricerca
I criteri di validità interna variano con il disegno dello studio correlato agli obiettivi (eziologia, diagnosi, prognosi, trattamento)
GIMBE®
Validità interna
A. I risultati dello studio sono validi?
Domande preliminari
Lo studio si basa su una domanda di ricerca
chiaramente formulata?
– Una domanda di ricerca è chiaramente formulata se sono
indicati la popolazione studiata, l’intervento erogato,
l’outcome considerato
Gli autori hanno utilizzato il tipo giusto di
studio?
– Deve esserci coerenza tra il disegno dello studio ed il
quesito.
Vale la pena continuare?
GIMBE®
Validità interna
A. I risultati dello studio sono validi?
Domande specifiche
L’assegnazione dei pazienti al trattamento è
randomizzato?
– I migliori metodi di randomizzazione sono
quelli realizzati con il computer o con le
tabelle dei numeri random; sistemi più deboli
sono rappresentati dalla assegnazione
alternata, secondo l’ordine di arrivo o del
giorno, settimana o mese.
Se non c’è stata randomizzazione
si può anche lasciare perdere
l’articolo in esame.
GIMBE®
Validità interna
Tutti i pazienti ammessi a partecipare allo studio
sono stati considerati in modo appropriato nella
formulazione delle conclusioni?
– La percentuale dei pazienti persi al follow up
non può essere superiore al 20%.
– L’analisi dei risultati deve essere effettuata
secondo l’intention to treat.
GIMBE®
Validità interna
Vi era cecità rispetto al trattamento? A che livelli?
– Considera che questo non è sempre
possibile, ma valuta se sono stati compiuti
tutti gli sforzi per assicurare la cecità. In
generale coloro che analizzano i dati
possono sempre non essere resi a
conoscenza a che gruppo si riferiscono i
dati che stanno analizzando.
GIMBE®
Validità interna
I gruppi erano simili all’inizio dello studio?
– Valuta l’eventuale presenza di fattori di
confondimento che possono avere influito
sugli esiti.
Al di là dell’intervento sperimentale i
partecipanti sono stati assistiti nello stesso
modo?
– Ad esempio verifica se sono stati valutati agli
stessi intervalli.
GIMBE®
Critical appraisal
3. Applicabilità
4. Consistenza
GIMBE®
Rilevanza clinica
Misura, oltre la significatività statistica, l’entità e la precisione dei risultati dello studio
Corrisponde alla fase di analisi dei risultati
Gli elementi che influenzano la rilevanza clinica sono diversi per gli studi di:
• trattamento
• diagnosi
• eziologia e prognosi
GIMBE®
Rilevanza clinica
Studi di eziologia e prognosi
‐ Rilevanza clinica del danno o fattore prognostico
‐ RR (studi di coorte), OR (studi caso‐controllo)
‐ Ampiezza limiti di confidenza
Copyright © - GIMBE®
Rilevanza clinica
Studi su trattamenti
‐ Rilevanza clinica dell’end‐point (surrogato vs significativo)
‐ NNT
‐ Ampiezza limiti di confidenza
GIMBE®
Rilevanza clinica
B. Quali sono i risultati?
Quanto è grande l’effetto del trattamento?
– Verificare che i parametri di efficacia non
corrispondano al valore dell’ipotesi nulla.
– Quanto precisa è la stima dell’effetto del
trattamento?
– Valutare gli intervalli di confidenza ed il valore
di “p”.
GIMBE®
Critical appraisal
3. Applicabilità
4. Consistenza
GIMBE®
Applicabilità
E’ il grado con cui i risultati dello studio possono essere applicati al paziente individuale
Sinonimi: validità esterna, generalizzabilità
E’ condizionata da: criteri di selezione dei pazienti: inclusione, esclusione
setting assistenziale: aspetti strutturali‐organizzativi, tecnologie, “competence” professionale
descrizione accurata e completa delle modalità di erogazione degli interventi sanitari
GIMBE®
Applicabilità
C. Il risultato potrà aiutarti a livello locale?
I risultati possono essere applicati alla
popolazione locale?
– Considera se i pazienti considerati nel trial
sono così diversi da quelli da te assistiti.
GIMBE®
Internal validity
(efficacy)
Pragmatic
RCTs
Explanatory
RCTs
Generalizzability
(effectiveness)
Ideal
condition
GIMBE®
Routine
clinical practice
Critical appraisal
3. Applicabilità
4. Consistenza
GIMBE®
Consistenza
Si riferisce al fatto che i risultati di uno studio vengano confermati da altri studi Sinonimi: riproducibilità della ricerca
Lo strumento ideale per verificare la consistenza della ricerca è rappresentato dalle revisioni sistematiche con meta‐analisi di più studi
N.B. Il bias di pubblicazione, oltre a sovrastimare l’efficacia dei trattamenti, influenza la consistenza della ricerca
GIMBE®
Mortality in the placebo and beta-blocker groups of 22 RCTs
GIMBE®
Brophy JM, et al. Ann Intern Med 200
Lo studio è valido?
no
sì
I risultati dello studio sono clinicamente rilevanti?
no
Presenza di bias che compromettono la validità dei risultati
Risultati validi, ma irrilevanti: end points surrogati, NNT elevato, ampi limiti di confidenza
sì
I risultati dello studio sono applicabili?
sì
Best evidence for patients
GIMBE®
no
Risultati validi e clinicamente rilevanti, ma non applicabili nel mio contesto assistenziale
GIMBE
THE END
GIMBE®
®
Gabriele Manzi
GIMBE®
Clinical state, setting
and circumstances
Patient
preferences
Clinical
expertise
Research
evidence
Health care
resources
Modificato da Haynes RB et al ACP J Club. 2002; 136:A11-14.
GIMBE®
“…E’ causa di grande preoccupazione constatare
come la professione medica non abbia saputo
organizzare un sistema in grado di rendere
disponibili e costantemente aggiornate, revisioni
critiche sugli effetti dell’assistenza sanitaria”
Archibald Cochrane, 1972
GIMBE®
BIOMEDICA
RICERCA
PRIMARIA
• Studi
Osservazionali
• Studi
Sperimentali
RICERCA
SECONDARIA
FONTI
TERZIARIE
• Revisioni
Sistematiche
• Linee Guida
• Analisi
economiche
• Revisioni
tradizionali
• Trattati
• Editoriali
• Parere degli
esperti
GIMBE®
Reviews
Hanno lo scopo di rendere più
“maneggevole” una enorme quantità di
informazioni
• revisioni tradizionali
• revisioni sistematiche
GIMBE®
Narrative reviews vs Systematic reviews
Le revisioni tradizionali, oltre ad avere obiettivi
molto ampi, non sono basate su una ricerca
sistematica delle evidenze disponibili, ma su una loro
selezione legata a diversi fattori: reperibilità,
accessibilità linguistica, sintonia con le idee
dell’autore
Nelle revisioni sistematiche esiste un protocollo
che definisce un preciso obiettivo e descrive fonti e
metodi utilizzati per ricercare, selezionare e
sintetizzare gli studi primari.
Cartabellotta A. Rec Prog Med 1998
GIMBE®
Narrative reviews
Paragonabile al capitolo di un libro
Non esplicitato il metodo di selezione,
sintesi e interpretazione delle evidenze
Impossibilità di fare una valutazione
critica
Obiettivi molto ampi
Spesso fatte su commissione
GIMBE®
Systematic reviews
E’ una ricerca scientifica vera e propria
Esplicitato il protocollo metodologico
Possibilità di fare una valutazione critica
(definizione di obiettivi, fonti, criteri…)
Sintesi dei risultati spesso fatta con
metodo meta analitico
GIMBE®
Narrative vs Systematic Review
Caratteristica
Revisione
narrativa
Revisione
sistematica
Domande
Molto vasta
Focalizzate su problemi
clinici
Risorse e ricerca
Non specificate
Complete ed esplicite
Selezione
Non specificate
Basato su criteri espliciti
Critical appraisal
Variabile
Rigoroso
Sistesi
Qualitativa
Qualitativa/quantitativa
*meta-analisis
GIMBE®
Systematic reviews
E’ uno strumento capace di risolvere alcuni
problemi legati alla ricerca e alla
interpretazione delle migliori evidenze
Ha un ruolo sempre crescente per la pratica
clinica,
per la formazione permanente,
per le decisioni di politica sanitaria
GIMBE®
Systematic reviews
Processo di sviluppo delle RS
Nel loro approccio, i ricercatori, usano
metodi che sono pianificati e documentati
in un protocollo di revisione sistematica
Il protocollo di revisione sistematica
assicura che la revisione sia condotta con
lo stesso rigore che ci si aspetta da una
ricerca
GIMBE®
Systematic reviews
Componenti essenziali del processo
Definizione del quesito
Strategia di ricerca
Definizione dei criteri di
inclusione/esclusione
Valutazione qualità degli studi
Estrapolazione dei dati
Analisi dei dati
GIMBE®
GIMBE®
Quesito
E’ un momento fondamentale del processo
di una revisione sistematica
Deve definire i partecipanti, gli interventi da
valutare, gli esiti da misurare
Quesiti poco chiari e troppo generici
portano a revisioni di qualità scadente
GIMBE®
GIMBE®
Definizione dei criteri di
inclusione/esclusione
Devono essere sempre definiti prima di
iniziare la ricerca degli studi in
letteratura
Devono riflettere direttamente il quesito
della ricerca (partecipanti, interventi,
risultati)
Devono indicare che tipo di studi
verranno selezionati
GIMBE®
GIMBE®
Elementi per la strategia di ricerca
Database
Ricerca manuale
Ricerca bibliografica
Contatti con i ricercatori
Ricerca di letteratura non pubblicata
GIMBE®
Elementi per la strategia di ricerca
Lo scopo è di identificare quanti più studi
ragionevolmente possibili sull’argomento;
Anche i lavori non pubblicati devono essere
richiesti per aiutare a minimizzare i rischi di
pubblicazione. Le riviste tendono ad
accettare e pubblicare più facilmente le
ricerche che dimostrano un esito positivo.
Includere, quando è possibile, anche le
pubblicazioni non in inglese
GIMBE®
GIMBE®
Valutazione della qualità degli studi
Una RS di buona qualità dipende soprattutto
dalla qualità degli studi inclusi
La valutazione dello studio deve essere
condotta sulla metodologia della ricerca
utilizzata per condurre lo studio stesso
Esistono numerosi strumenti per effettuare
una valutazione critica
Va definito il metodo usato per la
valutazione e per l’analisi delle informazioni
per farli conoscere ai fruitori della RS
GIMBE®
GIMBE®
Sintesi degli studi
Definizione dei risultati importanti per le
proprie necessità
Estrapolare i dati nel modo più accurato
possibile, tale che siano di facile utilizzo
quando si affronta l’analisi dei dati stessi
Creare uno strumento (es. una griglia)
dove registrare le informazioni che
servono, relative ad ogni studio incluso
nella RS
GIMBE®
GIMBE®
Sintesi dei dati
L’obiettivo di una revisione sistematica è di
riassumere i risultati di differenti studi per
ottenere una valutazione totale dell’efficacia
di un intervento o di un trattamento.
Per un buon livello qualitativo, l’analisi dei
dati va definita nel protocollo della RS.
GIMBE®
Sintesi dei dati
L’obiettivo di una revisione sistematica è di
riassumere i risultati di differenti studi per
ottenere una valutazione totale dell’efficacia
di un intervento o di un trattamento.
Sintesi dei dati
I dati di una revisione si possono analizzare
in modi differenti: se quantitativi si prestano
al metodo statistico della meta-analisi, se
qualitativi possono essere presentati in
forma narrativa
In entrambi i casi è opportuno definire a
priori la metodologia che si intende seguire
GIMBE®
GIMBE®
Systematic reviews critical appraisal
La RS presenta un quesito ben definito?
(partecipanti, interventi, outcomes)
E’ stato fatto un lavoro sostanziale per la
ricerca della letteratura? (database appropriati,
key words adeguate, letteratura non pubblicata)
E’ stata data una corretta e chiara definizione
dei criteri di inclusione/esclusione? Questi sono
adeguati?
GIMBE®
Systematic reviews critical appraisal
E’ stata valutata dagli autori la qualità degli
studi inclusi nella RS?
Gli autori forniscono sufficienti informazioni
sui singoli studi? (progetto di ricerca,
dimensione del campione, descrizione
dell’intervento, risultati ottenuti)
Gli studi sono stati assemblati in modo
appropriato? (test di eterogeneità)
GIMBE®
Meta-analisi
Metodo per combinare i risultati di più studi
indipendenti in modo da poter calcolare un
valore statistico riassuntivo totale (cumula
insieme più studi per ottenere un risultato
unico)
Consente una stima più precisa dell’efficacia di
un trattamento
La validità di una MA dipende anche dalla
qualità della RS su cui si basa
GIMBE®
Systematic review Æ analisi metodologica
Meta – analisi Æ analisi statistica
GIMBE®
***
Revisioni sistematiche
(qualitative)
Meta – analisi
(non sistematiche)
(quantitative)
GIMBE®
Meta-analisi
E’ utilizzata nei casi di incertezza nella valutazione
di efficacia di un trattamento
quando i risultati dei singoli studi non sono univoci
quando singoli studi sono effettuati su un campione di
pazienti numericamente scarso (poco affidabili)
LA COMBINAZIONE DEI DATI DIMINUISCE
L’IMPRECISIONE DEI RISULTATI DEI SINGOLI STUDI
GIMBE®
GIMBE®
Antmann EM, et al. JAMA 1992
GIMBE®
The
Cochrane Library
GIMBE®
Cochrane Collaboration
Nel 1993 nasce la fondazione Cochrane
Collaboration, un network internazionale con
l’obiettico di “preparare, aggiornare e disseminare
revisioni sistematiche degli studi clinici controllati
sugli effetti dell’assistenza sanitaria e, laddove
non disponibili studi clinici controllati, revisioni
sistematiche delle evidenze comunque esistenti.”
Chalmers I., 1993
GIMBE®
Cochrane Library
• Rappresenta la prima fonte di consultazione se il
“bisogno d’informazione” riguarda l’efficacia dei
trattamenti
• Tuttavia:
- il n° totale di revisioni sistematiche (RS) è
ancora modesto (poco più di 1.800 ripetto alle
30-35 mila previste)
- la maggior parte di RS riguarda 5-6 specialità
cliniche
- molte RS non sono conclusive (aree grigie)
GIMBE®
Cochrane Library
Rappresenta la prima fonte di consultazione se
il “bisogno d’informazione” riguarda l’efficacia
dei trattamenti
Nonostante sia la fonte di elezione dispone di
un numero di revisioni sistematiche ancora
limitato (circa 4200 vs 35000 previste)
Molte RS in essa contenute riguardano un
numero esiguo di specialità cliniche
molte RS non sono conclusive (aree grigie)
GIMBE®
Cochrane Database of Systematic Reviews
(CDRS)
È il database principale:
raccoglie le revisioni sistematiche realizzate dai
gruppi di revisione Cochrane (circa 4200)
raccoglie i protocolli di quelle in corso di
realizzazione (circa 1900)
abstracts delle revisioni e titoli dei protocolli sono
disponibili gratuitamente on-line.
GIMBE®
Database of Abstract of Reviews of
Effectiveness - (DARE)
Contiene circa 12.000 abstract di revisioni
sistematiche pubblicate in letteratura e valutate
criticamente a cura del NHS Centre for Review and
Dissemination.
Accessibile gratuitamente on-line
GIMBE®
Cochrane Controlled Trial Register
(CCTR)
E’ un registro creato dopo una ricerca manuale delle
riviste pubblicate dal 1945 ad oggi per identificare
tutti i RCTs pubblicati; include più di 608.000 RCTs
(molti non indicizzati nelle banche dati tradizionali)
GIMBE®
altre risorse
The Cochrane Methodology Register (CMR)
Health Technology Assessment Database
NHS Economic Evaluation Database
GIMBE®
www.evidencebasednursing.it
GIMBE®
Mappa della ricerca
QUESITO
FOREGROUND - PICO
BACKGROUND
Cos’è il TIA o come
si assiste il paziente
con TIA
TESTO o
E-BOOK!
E’ meglio posizionare il
paziente con Braden <13 su
materasso a cessione d’aria
o a pressione alternata?
RS
LG
Studi Primari
ESPERIENZA - RICERCA
GIMBE®
Mappa della ricerca
Studi Primari
QUESITO
E’ un trattamento?
Trial randomizzto e controllato
Ricerca una causa?
Studio di coorte o Caso controllo
Vuole conoscere una prognosi?
Studio di coorte
Vuole valutare un test
diagnostico?
Studio di sensibilità
e specificità
GIMBE®
GIMBE
THE END
GIMBE®
®
Gabriele Manzi
GIMBE®
Cosa sono le evidenze
scientifiche?
Conoscenze che derivano dalle prove di efficacia,
ossia dalle sperimentazioni cliniche.
Esse permettono l’acquisizione di nuove
informazioni partendo dal dato empirico derivante
da esse.
Non è evidenza ciò che è
esplicito, ma ciò che è
dimostrato.
GIMBE®
Obiettivi di un sistema sanitario
impossibilitato a soddisfare tutte le
domande di salute
Garantire un livello omogeneo di assistenza su obiettivi
prioritari di prevenzione, diagnosi, terapia, riabilitazione
Utilizzare criteri scientifici, oggettivi, riproducibili per
decidere come impiegare le risorse economiche.
Cartabellotta, 1998
GIMBE®
Come è arrivata l’EBHC
in Italia?
•PSN 1998-2000
•DLGs 229/99
GIMBE®
Il PSN 1998-2000
Il Piano Sanitario Nazionale 19982000 definisce i livelli di assistenza
alla luce di principi ispiratori del
Servizio Sanitario Nazionale.
GIMBE®
Principi ispiratori del SSN
Principio dell'efficacia e
dell'appropriatezza degli
interventi:
le risorse devono essere indirizzate verso le
prestazioni la cui efficacia é riconosciuta in
base alle evidenze scientifiche e verso i
soggetti che maggiormente ne possono trarre
beneficio.
GIMBE®
Appropriatezza
Appropriatezza clinica
Misura in cui un particolare intervento è sia
efficace sia indicato per la persona che lo
riceve.
- benefici > effetti negativi e disagi
- benefici/danni > con = costi
- è relativa solo alla fase decisionale
GIMBE®
Appropriatezza
Appropriatezza organizzativa
o generica
Misura in cui un intervento viene erogato al
livello più gradito all’utente e/o meno costoso
(ad es. domicilio, ambulatorio, day-hospital,
ricovero)
PSN 1998-2000
appropriatezza = efficacia
GIMBE®
Coordinato dall’Istituto Superiore di Sanità in
collaborazione con l’ASL 6 di Livorno, l'Agenzia di sanità
pubblica della regione Lazio, CeVEAS di Modena
e il Centro Cochrane Italiano.
http://www.snlg-iss.it/
GIMBE®
Linee Guida
Raccomandazioni di comportamento clinico,
prodotte attraverso un metodo sistematico di
revisione della letteratura, che informano tutti gli
attori di un sistema sanitario (professionisti,
manager, utenti) sulle modalità più appropriate
per la gestione di presentazioni cliniche e
malattie.
GIMBE®
Contesto Locale
Insieme delle variabili (strutturali, organizzative,
tecnologiche,
professionali)
in
grado
di
condizionare l’applicazione locale di singole
raccomandazioni delle LG.
GIMBE®
Percorso Assistenziale*
• Strumento finalizzato all’implementazione delle LG
• In relazione al contesto locale il PA:
- adatta le raccomandazioni delle LG
- definisce:
- procedure (how)
- professionisti (who)
- setting (where)
- tempistiche (when)
• In altri termini: PA= LG + CL
* da preferire a “percorso diagnostico-terapeutico”, termine più riduttivo
GIMBE®
Processo (assistenziale)
• Rappresenta l’unità elementare del PA (costituito
da N processi)
• In funzione del numero di strutture e di
professionisti coinvolti nella sua erogazione il
processo può essere:
- molto semplice, talora sino ad identificarsi con
la procedura (vedi)
- estremamente complesso
• L’appropriatezza dei processi è correlata al
grado di aderenza alle raccomandazioni cliniche.
GIMBE®
Procedura (operativa)
• Sequenza di azioni tecnico-operative attraverso cui il
professionista eroga gli interventi assistenziali.
• Rappresenta l’unità elementare del processo (costituito
da N procedure)
• Può essere molto semplice (cateterismo vescicale) o
estremamente
complessa
(chirurgia
dell’aorta
addominale).
• La conformità della procedura dovrebbe essere
definita da standard di competence professionale
(accreditamento?)
• Sinomini: istruzione operativa
GIMBE®
A
B
Percorso assistenziale
Processo
Procedura operativa
GIMBE®
C
D
Protocollo
• Può indicare, indifferentemente, un processo o una
procedura.
• Nella letteratura giuridica il termine protocollo viene
interpretato come “vincolante” (mandatory) per i
professionisti.
• Questo termine, in disuso nella letteratura internazionale,
dovrebbe essere limitato, anche per i potenziali rischi
medico-legali.
GIMBE®
Linee Guida
Basate sull’evidenza
GIMBE®
Non basate sull’evidenza
Le non evidence based guide line erano diffuse
soprattutto qualche anno fa, erano fatte in casa
(senza l’ausilio di revisioni sistematiche) ed avevano
come risultato quello di validare i comportamenti già
in atto.
GIMBE®
Le evidence based guide line si caratterizzano per:
la multidisciplinarietà del gruppo responsabile
della produzione della linea guida
la valutazione sistematica delle prove scientifiche
disponibili quale base per le raccomandazioni
formulate
la classificazione delle raccomandazioni in base
alla qualità delle prove scientifiche che le
sostengono
Grilli et al, Lancet 2000
GIMBE®
Requisiti per lo sviluppo e
l’applicabilità di una EBGL
• Deve dichiarare la qualità delle informazioni
utilizzate
(livello
di
evidenza)
e
evidenza
l’importanza/rilevanza/fattibilità/priorità della loro
implementazione
(forza
delle
raccomandazioni).
raccomandazioni
• Deve essere prodotta attraverso un processo
multidisciplinare (cittadini/pazienti): ciò migliora
la qualità delle linee guida e favorisce la sua
adozione nella pratica.
GIMBE®
• Deve esplicitare le alternative di trattamento e
i loro effetti sugli esiti.
• Deve essere flessibile e adattabile alle
mutevoli condizioni locali.
• Nel produrre una linea guida dovrebbero essere
esplicitati i possibili indicatori di monitoraggio
utili a valutarne l’effettiva applicazione.
GIMBE®
• Una linea guida dovrebbe essere aggiornata
con
regolarità
per
evitare
che
le
raccomandazioni divengano obsolete.
• Una linea guida deve essere chiara,
chiara dotata di
una struttura semplice e di un linguaggio
comprensibile,
esplicitando
in
modo
inequivocabile i punti ritenuti fondamentali e le
aree di incertezza.
GIMBE®
Technology Assessment
Le tecnologie sanitarie sono l’insieme delle risorse
materiali e delle conoscenze indispensabili per
l’erogazione delle prestazioni sanitarie.
La valutazione delle tecnologie sanitarie
(technology
assessment)
consiste
nella
valutazione di benefici, rischi e costi (clinici,
sociali, economici, di sistema) del trasferimento
delle tecnologie sanitarie nella pratica clinica.
GIMBE®
Technology Assessment
Si pone ad un livello di complessità superiore
occupandosi non solo di aspetti clinici, ma anche
organizzativi, gestionali, economici e sociali.
Mira fornire ad amministratori e decisori della
sanità le informazioni necessarie per fare scelte
appropriate sul piano dell’efficienza allocativa.
GIMBE®
GIMBE®
Chi produce linee guida?
• Agenzie internazionali
• Agenzie nazionali
• Agenzie governative
• Agenzie regionali
• Aziende sanitarie
GIMBE®
Perché?
variabilità degli interventi professionali;
esigenza di confronto tra attori diversi con
differenti punti di vista (amministratori, pazienti,
operatori);
crescente complessità ed aumentata insorgenza
di obsolescenza delle tecnologie sanitarie;
aumentate aspettative da parte della popolazione.
GIMBE®
Aspetti legali
Le linee guida riassumono in modo sistematico
le migliori modalità assistenziali, attualmente
riconosciute, relative ad una patologia o
condizione.
Esse vengono assunte come “regole d’arte” e
nelle controversie giudiziarie costituiscono il
criterio per valutare la correttezza tecnica della
condotta del professionista sanitario nella cura
del paziente.
GIMBE®
Aspetti legali
La dottrina medico-legale, pur affermando che
per valutare la condotta professionale dei
professionisti si deve fare riferimento anche a
questi documenti, nella stesura vigente
sottolinea correttamente “il valore relativo” delle
linee guida “per i singoli casi”.
®
®
GIMBE
GIMBE
Aspetti legali
Le linee guida, infatti, fanno riferimento a un
paziente astratto, non a “quel particolare
paziente” che il professionista deve in concreto
curare, con la sua complessità clinica e la sua
specificità patologica.
GIMBE®
Aspetti legali
La diligenza deve essere valutata in relazione
alle circostanze concrete nelle quali il
professionista si è trovato a operare e, quindi, la
giurisprudenza riconosce al sanitario pur sempre
uno spazio di discrezionalità tecnica nella scelta
del metodo di cura da proporre al singolo
paziente, purché fondata correttamente su dati
oggettivi riscontrati durante il processo di cura.
GIMBE®
Aspetti legali
In definitiva il professionista nella sua pratica
clinica deve confrontarsi con il contenuto di una
linea guida che, se correttamente elaborata e
aggiornata, può contenere le “regole dell’arte”
astrattamente
applicabili.
Peraltro
la
responsabilità della scelta del trattamento da
praticare compete al professionista che ha in
cura il paziente e, quindi, spetta al sanitario
verificare se nel caso di specie si deve seguire
l’indicazione contenuta nella linea guida ovvero è
opportuno discostarsene a ragion veduta.
GIMBE®
Aspetti legali
Le linee guida non sono infatti vincolanti per
legge e, quindi, non sussiste alcun obbligo di
automatica applicazione.
Le linee guida, se correttamente interpretate, non
deresponsabilizzano quindi il professionista, ma
rendono invece più evidente il suo dovere di
motivare e documentare le scelte dei
comportamenti di cura.
GIMBE®
Aspetti legali
E’
opportuno
infine
ricordare
che
la
responsabilità
professionale
presuppone
l’accertamento non solo della colpa, ma anche
della sussistenza del rapporto di causalità tra
comportamento del sanitario ed evento dannoso
per il paziente.
GIMBE®
GIMBE®
Livello delle prove
Il livello delle prove (LdP) indica che la
raccomandazione è derivata da studi pianificati
e condotti in modo tale da produrre informazioni
valide e prive di errori sistematici.
GIMBE®
Livello delle prove
Complessivamente, quindi, si può affermare che
l’LdP dipende da:
appropriatezza del disegno di studio utilizzato
(qualità metodologica a priori);
rigore e qualità di conduzione e analisi dello studio
(controllo dell’errore sistematico);
dimensione
dell’effetto
clinico/epidemiologico
misurato (controllo dell’errore casuale);
appropriatezza e rilevanza degli indicatori di esito
utilizzati (rilevanza clinico-epidemiologica).
GIMBE®
Forza delle raccomandazioni
La forza della raccomandazione (FdR) si
riferisce alla probabilità che l’applicazione nella
pratica di una raccomandazione determini un
miglioramento dello stato di salute della
popolazione.
GIMBE®
Livello delle evidenze
GIMBE®
GIMBE®
GIMBE®
Ceveas
GIMBE®
Ceveas
GIMBE®
Come nasce una LG
Più volte sono state prodotte per consenso,
attraverso una riunione di un gruppo di esperti nel
corso di un convegno per discutere e concordare
quale sia il comportamento ottimale da tenere per
affrontare un determinato problema.
GIMBE®
Questo processo presenta molti limiti:
non sempre l’opinione degli esperti riflette lo
stato delle conoscenze mediche!
l’analisi della letteratura scientifica potrebbe
non essere eseguita in modo sistematico!
le conclusioni a cui si arriva possono non
essere valide da un punto di vista scientifico!
GIMBE®
Consenso informale
Gli esperti si confrontano nel corso di una o più
discussioni non strutturate e infine arrivano a un
consenso sulle raccomandazioni.
Questo tipo di consenso viene definito informale in
quanto i criteri di scelta non sono fissati a priori e
le raccomandazioni espresse possono riflettere
l’opinione dei più esperti o di coloro che hanno
maggiore autorevolezza o potere.
E’ il metodo più semplice, pratico e utilizzato, ma è
anche quello scientificamente meno valido.
GIMBE®
Consenso formale
Il confronto in questo caso è più strutturato, in
modo che tutte le posizioni espresse all’interno del
gruppo possano essere rappresentate e discusse.
Il metodo più noto è il Metodo” Delphi”: a tutti i
partecipanti viene inviato un questionario con le
domande principali sulle quali si vuole trovare un
consenso.
I partecipanti rispondono e si arriva via via a
discutere
le
risposte-comportamenti
più
raccomandati.
GIMBE®
Metodo Rand
Ideato dalla Rand Corporation negli anni Ottanta,
prevede gruppi di 10-11 esperti che vengono chiamati
a definire i criteri di utilizzo appropriato di una
procedura o di un trattamento, avendo come
riferimento non il paziente medio, ma specifiche
categorie di pazienti.
Il suo limite è che nonostante si combinino le opinioni
degli esperti con le indicazioni della letteratura, la
valutazione resta sempre soggettiva.
E’ difficile coinvolgere più figure professionali, con
diverso potere.
GIMBE®
Consensus Conference
Prevede momenti di discussione e verifica delle
informazioni disponibili sui diversi aspetti di una
tecnologia sanitaria, in modo da definire lo stato
dell’arte e produrre raccomandazioni per la pratica
clinica.
GIMBE®
NICE
National Institute of Clinical Excellence
http://www.nice.org.uk
Organizzazione creata dal governo inglese
all’interno del suo servizio sanitario nazionale
Valuta l’efficacia degli interventi medici,
rappresentati da farmaci, dispositivi medici,
procedure fisioterapiche e chirurgiche allo scopo
di orientare una corretta utilizzazione delle
risorse da parte del National Health Service
inglese
Produce linee guida cliniche e rapporti di
valutazione tecnologica
GIMBE®
GIMBE
THE END
GIMBE®
®
Stomach
Endoscopic cyanoacrylate injection versus b-blocker
for secondary prophylaxis of gastric variceal bleed:
a randomised controlled trial
Smruti Ranjan Mishra,1 Barjesh Chander Sharma,1 Ashish Kumar,2
Shiv Kumar Sarin1,2
1
Department of
Gastroenterology, G B Pant
Hospital, New Delhi, India
2
Department of Hepatology,
Institute of Liver & Biliary
Sciences (ILBS), New Delhi,
India
Correspondence to
Dr S K Sarin, Department of
Gastroenterology, G B Pant
Hospital, 201 Academic Block,
New Delhi 110002, India;
[email protected]
Revised 1 February 2010
Accepted 7 February 2010
ABSTRACT
Background and aims Bleeding from gastric varices is
often severe and difficult to manage. Endoscopic
injection of gastric varices with cyanoacrylate is effective
in prevention of rebleeding. The efficacy of b-blockers in
secondary prophylaxis of gastric variceal bleed has not
been well studied. A comparison of the efficacy of bblocker treatment and cyanoacrylate injection for the
prevention of gastric variceal rebleeding was carried out.
Methods Patients with gastro-oesophageal varices type
2 (GOV2) with eradicated oesophageal varices or
isolated gastric varices type 1 (IGV1) who had bled from
gastric varices were randomised to cyanoacrylate
injection (n¼33) or b-blocker treatment (n¼34).
Baseline and follow-up upper gastrointestinal endoscopy
and hepatic venous pressure gradient (HVPG)
measurements were performed. Primary end points were
gastric variceal rebleeding or death.
Results The probability of gastric variceal rebleeding
rate in the cyanoacrylate group was significantly lower
than in the b-blocker group (15% vs 55%, p¼0.004) and
the mortality rate was lower (3% vs 25%, p¼0.026)
during a median follow-up of 26 months. The median
baseline and follow-up HVPG in the cyanoacrylate group
were 15 (10e23) and 17 (11e24) mm Hg (p¼0.001)
and for the b-blocker group 14 (11e24) and 13
(8e25) mm Hg (p¼0.003). While no patient showed
reduction of HVPG in the cyanoacrylate group, in the bblocker group 12 of 28 (42%) patients were responders,
of which 5 (41% of responders) bled. On multivariate
analysis, treatment method, portal hypertensive
gastropathy and size of the gastric varix >20 mm
independently correlated with gastric variceal rebleeding.
Gastric variceal rebleeding independently correlated with
mortality.
Conclusions Cyanoacrylate injection is more effective
than b-blocker treatment for the prevention of gastric
variceal rebleeding and improving survival.
Clinical trial number NCT00888784.
INTRODUCTION
Gastric varices (GVs) are less common than oesophageal varices, occurring in w20% of patients
with portal hypertension.1 GVs bleed less
frequently than oesophageal varices1; however
bleeding tends to be more severe, requires more
transfusions and has a higher mortality rate than
oesophageal variceal bleeding.1e3 After control of
acute bleeding, GVs have a high rebleeding rate
(34e89%) and treatment options to prevent
rebleeding are not optimised.2 4 5 The frequency
Gut 2010;59:729e735. doi:10.1136/gut.2009.192039
Significance of this study
What is already known about this subject?
< The efficacy of b-blockers in secondary prophy-
laxis of gastric variceal bleeding has not been
well studied.
< Endoscopic injection of cyanoacrylate has been
proven to be effective in the prevention of
gastric variceal rebleeding.
< b-Blockers have been recommended empirically
for prophylaxis of gastric variceal bleeding
without proof.
What are the new findings?
< The probability of rebleeding from large gastro-
oesophageal varices type 2 (GOV2) or isolated
gastric varices type 1 (IGV1) was significantly
lower in the cyanoacrylate group (15%)
compared with the b-blocker group (55%).
< The majority of patients with such gastric
varices have baseline HVPG >12 mm Hg.
< Cyanoacrylate injection increases the hepatic
venous pressure gradient (HVPG), and b-blocker
decreases HVPG significantly on follow-up.
< Mortality in patients treated with a b-blocker is
higher than with cyanoacrylate injection.
How might it impact on clinical practice in the
foreseeable future?
< Endoscopic injection of cyanoacrylate should be
used to prevent rebleeding from large GOV2 and
IGV1 and improve survival.
< b-Blockers are not effective in prevention of
rebleeding from such gastric varices, although
they decrease HVPG in these patients.
and severity of bleeding from GVs depends on their
location (figure 1). GVs could be associated with
oesophageal varices (gastro-oesophageal varices
type 1 along the lesser curve, or type 2 (GOV2),
along the fundus) or are present in isolation
(isolated gastric varix type 1 (IGV1), in the fundus
or at ectopic sites in the stomach or the first part of
the duodenum (isolated gastric varix type 2)).1
b-Blockers have been reported to reduce the risk
of rebleeding of oesophageal varices by w40% and
risk of death by 20%.6e11 A small open-label trial of
b-blocker and nitrate treatment in patients with
GVs reported no significant benefit for rebleeding or
survival, but there was a trend towards lower
rebleeding.12 Endoscopic injection of cyanoacrylate
729
Stomach
Figure 1 Classification of gastric varices.
has been proven to be effective in the prevention of GV
rebleeding.13e17 However, cyanoacrylate injection is associated
with a risk of cerebral or pulmonary embolism and damage to
the endoscope.18
Results on the long-term efficacy of gastric variceal obturation
with cyanoacrylate injection are variable, with eradication rates
between 50% and 100% and a rebleeding rate of 23e50%, with
the majority occurring in the first year.13 16 19e21 Due to the high
rate of primary haemostasis and lower rebleeding rate, GV
obturation with cyanoacrylate injection is used as the first-line
treatment for bleeding GVs and secondary prophylaxis of GV
bleed.
For patients who have bled from IGV1 or GOV2, N-butylcyanoacrylate, transjugular intrahepatic portosystemic shunt
(TIPS) or b-blockers have been recommended.22 Endoscopic
injection of thrombin has also been tried for secondary
prophylaxis of bleeding from GVs.23e26 b-Blockers have been
recommended empirically for GVs since they have been found to
be quite effective in patients who have bled from oesophageal
varices. However, there is no randomised controlled trial available
comparing b-blocker treatment with endoscopic cyanoacrylate
injection for prevention of rebleeding from GVs. We conducted
a randomised controlled trial to study the efficacy of endoscopic
cyanoacrylate injection versus b-blockers in the prevention of
gastric variceal (GOV2 or IGV1) rebleeding and improvement in
survival. We also investigated the predictors of rebleeding and
mortality in patients who had previously bled from GVs.
Patients and methods
Patients with portal hypertension with upper gastrointestinal
bleed were assessed. All consecutive patients with cirrhosis with
GVs (GOV2 with eradicated oesophageal varices or IGV1)
admitted to our hospital from August 2006 to March 2009 were
enrolled. The study was approved by our institutional review
board, and written informed consent was obtained from all
patients. Parents gave the written consent for children. Patients
who had bled from GOV2 or IGV1 were included. The exclusion
criteria were non-cirrhotic portal hypertension, presence of
730
oesophageal varices, contraindications to b-blockers and cyanoacrylate injection, patients already on b-blockers, past history of
cyanoacrylate injection or sclerotherapy for GVs or GV ligation,
TIPS, balloon-occluded retrograde transvenous obliteration
(BRTO), balloon-occluded endoscopic injection sclerotherapy,
shunt operation, undetermined origin of bleeding, hepatic
encephalopathy grade III/IV, hepatorenal syndrome, hepatocellular carcinoma, presence of deep jaundice (serum bilirubin
>10 mg/dl), cardiorespiratory failure, age >75 years, pregnancy
or patients who did not give informed consent.
After assessing eligibility with initial endoscopy and subsequently obtaining informed consent patients were randomised,
using a sealed envelope technique, to receive cyanoacrylate
injection (Nectacryl, Nectar Laboratories, Medak, India) or bblocker (propranolol). The allocation sequence remained with
the statistician and the sequence remained concealed from the
investigators until the intervention was assigned. The doctors
who evaluated the outcomes were blinded to group assignment.
According to the Baveno IV consensus on portal hypertension,22
secondary prophylaxis starts from the sixth day of acute variceal
bleed. For patients presenting with acute GV bleeding, an
emergency endoscopy was done and, if the bleeding was found
to be from GOV2 or IGV1, one cyanoacrylate injection was
given to control acute GV haemorrhage (not with the intention
of obliterating the GV). The patients were randomised and the
follow-up started from day 6 after occurrence of acute bleeding.
None of the patients presenting with acute GV bleeding received
b-blocker or endoscopic treatment for secondary prophylaxis
from day 0 till day 6. Patients with a previous bleed were
randomised after control endoscopy. For patients randomised to
the propranolol group, control endoscopy was completed and
propranolol was started on the same day. For patients randomised to the cyanoacrylate group, a second endoscopy with
cyanoacrylate injection was performed on the same day. In
patients with a history of past bleeding, the follow-up started
immediately after randomisation. The follow-up period was
terminated at 3 months after enrolment of the last patient.
Upper gastrointestinal endoscopy was done to determine the
variceal status at baseline and repeated at 3 month intervals or
during rebleeding. The size of the GV, recurrence of oesophageal
varices and portal hypertensive gastropathy (PHG) were
recorded. An attempt was made to localise the source of bleeding
at the time of emergency endoscopy. The size of the GV was
calculated by a zebra guide wire with markings made at 5 mm
intervals, and we classified the fundal varix as small (<10 mm),
medium (10e20 mm) or large (>20 mm). The colour of the GV
was observed as red or blue. The recent signs of haemorrhage
over the GV were the presence of a clot or blackish spot localised
over the GV.1 The presence or absence of oesophageal varices and
PHG were also evaluated. Concurrence of two independent
observers was required before characterising the type of GV and
its size. Rebleeding from GVs was managed with a cyanoacrylate injection in the b-blocker group and TIPS or surgery in the
cyanoacrylate group.
The primary end points of the study were rebleeding from
GVs or death. The actuarial probability of rebleeding from GVs
and mortality were calculated. Secondary end points were
increase or decrease in the size of GVs, appearance of new
oesophageal varices and appearance or worsening of PHG and
complications.
The diagnosis of cirrhosis was based on clinical, biochemical,
radiological and histological findings. Ultrasonography of the
abdomen with Doppler of splenoportal axis, CT angiography or
magnetic resonance (MR) angiography was done to diagnose
Gut 2010;59:729e735. doi:10.1136/gut.2009.192039
Stomach
cirrhosis and assess spontaneous shunts. Model for end-stage
liver disease (MELD) and ChildeTurcotteePugh (CTP) scores
were calculated at baseline and serial intervals.
Technique of cyanoacrylate injection
Endoscopic injection was performed intravariceally using
a therapeutic endoscope and a transparent Teflon injector, with
a 6e8 mm long, 21-gauge needle, with N-butyl-2-cyanoacrylate.
The cyanoacrylate was injected without dilution with lipiodol.
The maximum amount of cyanoacrylate injected per site was
2 ml, and the total amount of cyanoacrylate required to obliterate the GVs completely was recorded. An attempt was made
to obturate the GVs completely at one session by injecting
cyanoacrylate at multiple sites. If a second session was required,
it was done within a week of the initial session. The injected GV
was palpated using the hub of the injector with the needle
retracted to look for solidification and obliteration of the GV. If
the GV was not completely obturated, cyanoacrylate was reinjected till the whole GV became solidified.
b-Blocker treatment
Propranolol was started at a dose of 20 mg twice daily. The dose
was increased every alternate day by 20 mg to achieve target
a heart rate of 55/min or to the maximal dose of 360 mg/day if
the medication was well tolerated and the systolic blood pressure was >90 mm Hg. On the occurrence of intolerable adverse
effects, systolic blood pressure <90 mm Hg or pulse rate <55/
min, the dose of the medication was decreased step-wise, and
eventually stopped if these adverse events were disabling and
persisted. Reintroduction of the medication was attempted if
cessation of the drug resulted in improvement of the reported
side effect. Compliance was assessed by pill count, and monitoring the heart rate and blood pressure. Patients were monitored every day until adequate b-blockade was achieved, then
every month for the first 3 months and subsequently every
3 months. Patients were advised to refrain from consuming
alcohol and from taking non-steroidal anti-inflammatory drugs.
RESULTS
A total of 1005 consecutive patients with portal hypertension
and upper gastrointestinal bleeding presented to our hospital
between August 2006 and March 2009 (figure 2). Inclusion
criteria were not fulfilled in 938 patients. Sixty-seven (6.6%)
patients with cirrhosis who were enrolled in the study had GVs
(GOV2/IGV1) without oesophageal varices with a history of GV
bleed, and were randomised to receive either cyanoacrylate
injection (n¼33) or b-blocker (n¼34). One patient in the
cyanoacrylate group and two patients in the b-blocker group
refused the allocated intervention, so 32 patients with cirrhosis
received the allocated treatment in each group. One patient who
was lost to follow-up in the b-blocker group was included in the
final analysis.
Of the 64 patients, 15 presented with active bleeding from
GVs and were treated with a combination of vasoactive drugs
along with an emergency single injection of cyanoacrylate after
endoscopic confirmation of the source of bleed from GVs as
revealed by active ooze of blood (n¼6), spurt (n¼5) or presence
of a clot (n¼4). Of 15 patients presenting with acute gastric
variceal bleed, eight patients were randomised to the cyanoacrylate group and seven to the b-blocker group. Of the 64
patients, 49 had a history of bleed from GVs within a 15 day
period from the time of enrolment. These patients had undergone diagnostic endoscopy at the time of acute bleeding at
peripheral hospitals which confirmed bleeding from GVs as
shown by evidence of spurt (n¼7), ooze (n¼17) or presence of
a clot (n¼14) on GVs, and only large GVs (n¼11) without any
other obvious source of bleeding, such as oesophageal varices,
portal hypertensive gastropathy, gastric antral vascular ectasia,
gastric ulcer, gastric erosions and duodenal ulcer. However, these
patients did not receive endoscopic injection of cyanoacrylate.
They were managed with blood transfusions or infusion of fresh
frozen plasma, and some patients received vasoactive drugs.
Bleeding stopped in all the patients without endoscopic
HVPG measurement
After an overnight fast, HVPG measurement was performed as
previously described.27 HVPG was measured and repeated after
1 year in both groups. In patients who presented with GV
bleeding, HVPG was measured within 24 h. Patients were categorised into responders ($20% reduction in HVPG or HVPG
<12 mm Hg) and non-responders (<20% reduction in HVPG),
and the primary end points were calculated.
Statistical analyses
The rebleeding rates following cyanoacrylate injection have been
reported to be in the range of 15e25%.13e18 We hypothesised
that the rate of GV rebleeding would be more in the b-blocker
group (50%) in comparison with the cyanoacrylate group (15%).
Thus, to achieve a statistical power of 80% with an error of 5%
and drop-out rate of 20%, a sample size of 32 patients in each
group was calculated.
Fisher exact test was used to compare categorical groups.
Student t test and ManneWhitney U test were used for the
parametric and non-parametric data, respectively. For comparison of pretreatment and post-treatment parameters, paired
Student t test was used. The KaplaneMeier analyses and logrank test were applied to compare the cumulative probability of
rebleeding from the GV, and death. Each patient was analysed in
an intention to treat manner. A p value of <0.05 was considered
significant. Statistical analysis was done using SPSS, version 15.0
(SPSS, Chicago, Illinois, USA).
Gut 2010;59:729e735. doi:10.1136/gut.2009.192039
Figure 2
Trial profile.
731
Stomach
treatment. Upper gastrointestinal endoscopy performed after
a median period of 7 (4e15) days showed maroon-black spots
(n¼26), a nipple (n¼9) on GVs and only large GVs (n¼14)
without other obvious source of bleeding.
Baseline characteristics of both groups of patients were
comparable (table 1). The median follow-up period was 26
(range 3e34) months in both the groups. The most common
aetiology of cirrhosis was alcohol or cryptogenic (table 1). Four
patients in the b-blocker and six patients in the cyanoacrylate
group had IGV1; the rest of the patients had GOV2. The size,
colour of GVs and PHG between the two groups were similar
(table 1). MR portovenography and Doppler ultrasound
demonstrated spontaneous lienorenal shunt in the cyanoacrylate
group (n¼8) and in the b-blocker group (n¼11).
Gastric variceal obturation
Endoscopic intervention could be performed with 100% technical success. Complete obturation of GVs was achieved in all.
The mean number of sessions required to achieve obliteration
was 1.760.4 (range 1e2). The mean volume of cyanoacrylate
required was 6.061.7 (range 3e9) ml. Mean time to obturation
was 2.060.6 (range 1e3) days.
b-Blocker treatment
All patients were compliant to b-blocker treatment. The mean
daily dose was 160 (range 80e320) mg. The median time
interval to achieve the desired heart rate was 15 (range 5e20)
days, and during this period no patient bled from GVs.
the b-blocker group (55%) (p¼0.004, log rank 8.466, figure 3A).
The difference was also significant in the frequency of bleeding
over a median follow-up period of 26 months from GOV2
between the two groups (p¼0.005) (table 2). Of 34 patients in
the b-blocker group 15 had rebleed, and cyanoacrylate injection
was used to control rebleeding in 13 patients. In two patients,
bleeding from GVs was torrential and cyanoacrylate injection
could not be performed, and they were subjected to surgery. Of
33 patients in the cyanoacrylate group, two patients had
a rebleed and were managed by surgery.
Size of gastric varix
Over a median follow-up period of 26 (3e34) months in the
cyanoacrylate group all patients had a decrease in GV size
(table 2) and the median size of GVs decreased from w25
(10e35) to w5 (0e10) mm (p<0.01). In the b-blocker group, 15
of 34 (44%) patients had an increase in size of GVs and the size
of GVs increased from a median of 25 (10e35) to 30 (15e40)
mm (p<0.01). All patients bleeding from GVs in the b-blocker
group had an increase in size of at least 5 mm.
Appearance of oesophageal varices and PHG
Over a median follow-up period of 26 (3e34) months in the
cyanoacrylate group, 7 of 33 (21%) and in b-blocker group 4 of
34 (11%) patients developed oesophageal varices on follow-up
(p¼0.302). All patients had small oesophageal varices and did
not bleed. No significant difference was noted between the
cyanoacrylate group (24%) and the b-blocker (21%) group
regarding appearance or worsening of PHG (table 2).
Gastric variceal rebleeding
Over a median follow-up period of 26 (3e34) months, in the
cyanoacrylate group 3 (9%) (GOV2 (n¼2), IGV1 (n¼1)) patients
bled, in contrast to 15 (44%) (GOV2 (n¼12), IGV1 (n¼3)) in the
b-blocker group. The probability of GV rebleeding was significantly lower in the cyanoacrylate group (15%) compared with
Hepatic venous pressure gradient
Baseline HVPG was measured in 27 patients in the cyanoacrylate group (three patients refused and, in another two patients,
the hepatic vein could not be cannulated) and in 28 patients in
the b-blocker group (three patients refused and in another one
Table 1 Baseline characteristics of study group patients
Parameters
Cyanoacrylate (n[33)
b-Blocker (n[34)
p Value
Age (median, range) years
Sex (male:female)
Aetiology of cirrhosis (alcohol/
cryptogenic/viral and others)
Previous gastric variceal bleed
Ascites
Hepatic encephalopathy (grade I/II)
Serum bilirubin (median, range) mg/dl
Serum albumin (median, range) g/dl
INR (median, range)
Platelets (median, range) 105/ml
Serum creatinine (median, range) mg/dl
ChildePugh score (median, range)
ChildePugh class (A/B/C)
MELD score (median, range)
HVPG (median, range) mm Hg
Time interval between index bleed and
randomisation (median, range) days
Follow-up (median, range) months
Gastro-oesophageal varix type2
Isolated gastric varix type 1
Size of gastric varix (median, range) mm
Portal hypertensive gastropathy
Colour of gastric varix (red/blue)
Spontaneous shunts
40 (6e70)
19:14
12/14/7
39 (6e70)
26:8
11/15/8
0.702
0.109
0.779
4/33
15/33
3/33
1.6 (0.4e6.9)
2.8 (1.2e3.7)
1.6 (1.1e3.2)
1.1 (0.4e2.1)
0.7 (0.2e1.5)
9 (6e12)
4/12/17
15 (8e35)
15 (10e23)
7 (4e15)
3/34
15/34
3/34
1.6 (0.3e5.9)
2.9 (2e3.7)
1.5 (1.1e3.1)
1.1 (0.5e3.2)
0.7 (0.3e1.5)
9 (6e12)
5/13/16
15 (8e35)
14 (11e24)
7 (4e14)
1
1
1
0.756
0.782
0.812
0.695
0.818
0.753
0.970
0.360
0.894
1
26 (3e34)
26/33
6/33
25 (10e35)
12/33
25/8
11/33
26 (3e34)
28/34
4/34
25 (10e35)
17/34
23/11
10/34
0.947
0.495
0.495
0.202
0.213
0.572
0.791
HVPG, hepatic venous pressure gradient; INR, international normalised ratio; MELD, model for end-stage liver disease.
732
Gut 2010;59:729e735. doi:10.1136/gut.2009.192039
Stomach
HVPG on follow-up in the b-blocker group was significant
(p¼0.003). In the cyanoacrylate group, 22 of 27 (81%) patients
had HVPG >12 mm Hg at baseline and 22 of 23 (91%) patients
had HVPG >12 mm Hg on follow-up. In the b-blocker group, 24
of 28 (85%) patients had HVPG >12 mm Hg at baseline and 18
of 28 (64%) patients had HVPG >12 mm Hg on follow-up.
Response to treatment and rebleeding
Figure 3 Probability of freedom from gastric variceal rebleeding (A)
and survival (B) in the cyanoacrylate and b-blocker group.
patient the hepatic vein could not be cannulated). Follow-up
HVPG was measured in 23 patients in the cyanoacrylate group
(three patients refused HVPG measurement and in one patient
the hepatic vein could not be cannulated) and in 28 patients in
the b-blocker group. Median baseline HVPG was 15 (10e23)
mm Hg and follow-up HVPG was 17 (11e24) mm Hg in the
cyanoacrylate group. The increase in HVPG on follow-up in the
cyanoacrylate group was significant (p¼0.001). The median
baseline HVPG was 14 (11e24) mm Hg and follow-up HVPG
was 13 (8e25) mm Hg in the b-blocker group. The decrease in
In the cyanoacrylate group, none of the patients responded to
treatment in the form of reduction in HVPG. In the b-blocker
group, 12 of the 28 (42%) patients were responders. However, 5
(41%) responders bled during follow-up. Of the non-responders,
2 of 23 (8%) patients in the cyanoacrylate group and 8 of 16
(50%) patients in the b-blocker group bled. In responders, the
median baseline HVPG was 16 (12e22) mm Hg and the followup HVPG was 12 (8e16) mm Hg. A >20% reduction in HVPG
was achieved in 12 of 28 (42%) patients in the b-blocker group.
In non-responders in the b-blocker group, the median baseline
HVPG was 14 (11e24) mm Hg and the follow-up HVPG was
13.5 (10e25) mm Hg. Of the responders who bled, one had
IGV1 and the other four had GOV2. Those responders who did
not bleed had a mean decrease in HVPG from baseline of 30% in
comparison with a 22% decrease in those who bled. In patients
with IGV1, the baseline median HVPG was 13 (11e17) mm Hg
and in patients with GOV2 the baseline median HVPG was 14
(10e24) mm Hg.
Since all the patients were bleeders, it was interesting to note
that patients with spontaneous shunt (n¼21) at baseline had
a lower median HVPG (13 (10e15) mm Hg) than patients
without spontaneous shunt (n¼43) (18 (13e24) mm Hg
(p¼0.001)). Thus, the HVPG levels which predict the likelihood
of rebleeding in patients with or without shunt are 13 and
18 mm Hg, respectively. In patients with spontaneous shunt, 6
of 21 (28%) patients had a HVPG value of <12 mm Hg. In
patients without spontaneous shunt, all had HVPG >12 mm
Hg. Overall, 20% of patients who bled had a HVPG <12 mm Hg.
Patients with cirrhosis who bled had a median HVPG of 15
(11e24) mm Hg at baseline and 14 (9e25) mm Hg (p¼0.715) at
follow-up. In non-bleeders, the median HVPG at baseline was 14
(10e23) mm Hg and follow-up HVPG was 13.5 (8e22) mm Hg
(p¼0.031). HVPG decreased in 5 of 10 (50%) patients in the
b-blocker group with spontaneous shunt.
Complications
No significant difference in the frequency of complications was
observed between the two groups. One (3%) patient in the
Table 2 Results of treatment in the cyanoacrylate and b-blocker groups
b-Blocker group (n[34)
Cyanoacrylate group (n[33)
Characteristics
(n, %)
(n, %)
p Value
Follow-up (months)
Rebleeding
Bleed from GOV2
Bleed from IGV1
Increase in size of GV
Decrease in size of GV
Appearance of oesophageal varix
Aggravation/appearance of PHG
Bleed-related mortality
Overall mortality
GOV2
IGV1
Complications
26 (3e34)
3/33 (10%)
2/26 (8%)
1/6 (16%)
0
33/33 (100%)
7/33 (21%)
8/33 (24%)
0/33
1/33 (3%)
0/26
1/6 (16%)
1 (3%)
26 (3e34)
15/34 (44%)
12/28 (36%)
3/4 (75%)
15/34 (44%)
0
4/34 (11%)
7/34 (21%)
6/34 (17%)
8/34 (23%)
6/28 (21%)
2/4 (50%)
2 (6%)
0.004
0.005
0.190
0.001
0.001
0.302
1
0.024
0.046
0.023
0.5
1
GOV2, gastro-oesophageal varix type 2; GV, gastric varix; IGV1, isolated gastric varix type 1; PHG, portal hypertensive gastropathy.
Gut 2010;59:729e735. doi:10.1136/gut.2009.192039
733
Stomach
cyanoacrylate and 2 (6%) patients in the b-blocker group
developed spontaneous bacterial peritonitis, and improved on
treatment (table 2). Four patients in the b-blocker group developed headache and generalised weakness which was controlled
with dose reduction, and after 2e3 weeks the desired dose was
achieved. No serious adverse effects were noticed which required
stoppage of b-blocker.
Mortality
Over a median follow-up period of 26 (3e34) months, a significant
difference in the overall mortality was observed between the
cyanoacrylate (n¼1, 3%) and the b-blocker (n¼8, 25%) group
(p¼0.016, log rank¼5.808, figure 3B). Of 15 patients with
rebleeding in the b-blocker group, treated with cyanoacrylate
injection, 6 (19%) patients died in comparison with none in three
of 33 patients in the cyanoacrylate group (p¼0.024) who were
managed with surgery at the time of rebleeding. In the b-blocker
group, one patient died of hepatic encephalopathy and one patient
died of spontaneous bacterial peritonitis and hepatorenal
syndrome. In the cyanoacrylate group, one patient died of hepatic
encephalopathy. The cumulative 2-year survival rate was 90% in
the cyanoacrylate group and 52% in the b-blocker group. In the
b-blocker group, only one patient died among responders
compared with seven patients among non-responders (p¼0.077).
Predictors of GV rebleed and mortality
On univariate analysis, the variables which influenced GV
rebleeding were size of the GV >20 mm, a red or black spot over
the GV, blue colour of the GV, presence of PHG, international
normalised ratio (INR) $1.6, serum creatinine >1 mg/dl, CTP
score $9, MELD score $17 and the treatment method. On
multivariate analysis, the treatment method (p¼0.006), PHG
(p¼0.029) and size of the GV >20 mm (p¼0.002) independently
correlated with GV rebleeding. Variables which influence
survival on univariate analysis included GV rebleeding, presence
of PHG, CTP $9, MELD score $17 and the treatment method.
On multivariate analysis, GV rebleeding (p¼0.003) independently correlated with mortality.
DISCUSSION
The results of this large prospective randomised controlled
clinical trial comparing the efficacy of cyanoacrylate and
b-blocker treatmjent clearly demonstrate that endoscopic injection of cyanoacrylate is significantly more effective than bblocker treatment for the prevention of rebleeding from GVs.
Small studies have reported that drug treatment is effective in
the prevention of GV rebleeding after successful endoscopic
injection of the bleeding varix with tissue adhesives.12 28 A small
open-label trial of b-blocker and nitrate treatment reported no
significant benefit in terms of risk of rebleeding or survival.12
This study had several limitations such as a small sample size,
retrospective analysis, fixed and low doses of drug and lack of
haemodynamic monitoring. In another study, Evrard et al
randomised patients to receive 160 mg/day of propranolol
(n¼20) or histoacryl injection (n¼21) for secondary prophylaxis
of gastro-oesophageal variceal bleed.28 The rebleeding and
mortality rates were similar in the b-blocker and endoscopic
treatment groups, with a higher complication rate in the latter
(10% vs 47%). This study had limitations of early termination
due to increased complications in the histoacryl arm, no
haemodynamic monitoring and inclusion of patients with coexisting oesophageal varices. In fact, all patients had oesophageal
varices, and 17 had GVs (12 with GOV1 and 5 with GOV2).
734
The median baseline HVPG was 14.5 (10e24) mm Hg. Of 55
patients with bleeding from gastric varices, 16% had baseline
HVPG <12 mm Hg. Patients with gastric variceal bleed have
been shown to have lower HVPG than patients with bleeding
from oesophageal varices.29 In addition, w35% of patients with
gastric variceal bleed have portal pressure <12 mm Hg.29
Spontaneous splenorenal or gastrorenal shunts commonly
develop between the splenic vein (splenorenal shunt) and gastric
varices, respectively, and connect via the inferior phrenic or
suprarenal vein to the left renal vein.30 These shunts are seen in
up to 60% of patients with GVs.30 31 We found spontaneous
shunts in 33% of our patients. Median HVPG in patients with
spontaneous shunts was lower in comparison with those
without shunts (13 vs 18 mm Hg). In fact, HVPG was high in
patients with both GOV2 and IGV1 types of varices. Of the 15
patients who bled, 20% had HVPG of <12 mm Hg.
There are limited data on the efficacy of b-blockers in reduction of portal pressure in patients with GVs. Keeping this in
view, HVPG was measured to assess the response to b-blocker
treatment in all our patients with GVs. The median HVPG was
significantly reduced on follow-up; however, this could not
prevent GV rebleeding or mortality. Of the 12 responders to bblocker, five patients who rebled had a median decrease in HVPG
of 22% compared with 30% in seven patients who did not
rebleed. This raises the question of whether a 20% reduction in
HVPG is adequate to define a response to b-blocker in patients
with GVs, or whether it should be a >25% reduction. In the
cyanoacrylate group, HVPG increased on follow-up, however
without an increase in the frequency of rebleeding. This increase
in HVPG could be attributed to redistribution of blood flow in
the portal system and indirectly suggests adequacy of GV
obturation. Whether addition of a b-blocker to prevent the rise
in HVPG by cyanoacrylate injection will be helpful needs further
studies.
BRTO and balloon-occluded endoscopic injection sclerotherapy have shown good results for prophylaxis of high risk
fundal variceal bleed.31 32 In a prospective trial, Hong et al showed
that the therapeutic efficacies of glue and BRTO for the treatment
of active gastric variceal haemorrhage and/or high-risk GV
appeared to be similar. However, glue might be associated with
a higher rebleeding rate than BRTO, and BRTO could be an
effective rescue treatment for patients with gastric variceal
haemorrhage after initial treatment with glue.32 TIPS has been
found to be quite effective in the prevention of GV rebleeding, not
only by reducing the portal pressure, but also by allowing the
possibility of obliterating gastric variceal channels. In a randomised controlled trial (patients with both oesophageal varices and
GVs), Lo et al showed that TIPS was more effective than glue
injection in preventing rebleeding from GVs, with similar survival
and frequency of complications.33 However, all these procedures
are quite expensive, need expertise and are associated with
complications over time.18 29 32e35 Cyanoacrylate injection, on
the other hand, is easily available, rapid, cost-effective, requires
less expertise, is less invasive and has been shown to be very
effective in preventing GV rebleeding.13 16 19e21 34
Endoscopic variceal sclerotherapy is inappropriate for patients
with fundal varices.5 19 The randomised trials by Tan et al and Lo
et al compared histoacryl injection and variceal band ligation and
reported a rebleeding rate almost twice higher with ligation for
secondary prevention of bleeding from GVs.13 36
Generally not more than 2 ml of cyanoacrylate should be
injected at a single site to prevent embolic and thrombotic
complications.37 38 Precautions to prevent damage to the endoscope must be taken during cyanoacrylate injection. By avoiding
Gut 2010;59:729e735. doi:10.1136/gut.2009.192039
Stomach
lipiodol use, limiting the total volume per injection and by
proper monitoring, no embolic complication was observed in our
study population. Dilution of cyanoacrylate with lipiodol delays
solidification by 15e20 s, thus increasing the chances of distal
embolisation. In the undiluted form, it solidifies within 5e7 s,
and thus the chances of distal embolisation become almost
negligible. However, with undiluted cyanoacrylate injection,
there is a risk of needle impaction in GVs at the time of
injection.39
b-Blocker treatment was well tolerated and none of our
patients required withdrawal of treatment. In the cyanoacrylate
group, 21% of patients had appearance of new oesophageal
varices in comparison with 11% in the b-blocker group, although
none of the patients bled from oesophageal varices. The
frequency of PHG in both the groups was similar (24% vs 21%).
Survival in the cyanoacrylate group was better because of
a low frequency of rebleeding compared with the b-blocker
group. In both the groups, differences in mortality were due to
rebleed-related deaths. However, mortality due to other causes
(not rebleed related) was similar in both the groups. More
favourable survival in our patients could be due to close followup during the study period and the fact that w50% of the
patients with cirrhosis in both the groups were Child A and B.
The stronger aspects of our trial are that it was a randomised
controlled study and had a large sample size, a longer follow-up
period and a minimal drop-out rate. The weaker aspect of this
study is inclusion of only a selective patient group with IGV1
and GOV2 types of GVs. The applicability of the present study
is related only to patients with bleeding GVs of GOV2 and IGV1
types. GVs of GOV2 and IGV1 types are more common, and
associated with more frequent and more severe bleeds with
higher mortality rates as compared with other types of gastric
varices.1
In conclusion, the results of our study clearly demonstrate
that cyanoacrylate injection is more effective in the prevention
of rebleeding from GVs and improving patient survival as
compared with b-blocker treatment. Propranolol is not effective
in preventing gastric variceal rebleeding despite reduction in
HVPG.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
Competing interests None.
Patient consent Obtained.
Ethics approval This study was conducted with the approval of the MAMC Ethical
Committee, New Delhi.
30.
31.
Provenance and peer review Not commissioned; externally peer reviewed.
REFERENCES
1.
2.
3.
4.
5.
6.
7.
8.
9.
Sarin SK, Lahoti D, Saxena SP, et al. Prevalence, classification and natural history of
gastric varices: a long-term follow-up study in 568 portal hypertension patients.
Hepatology 1992;16:1343e9.
Trudeau W, Prindiville T. Endoscopic injection sclerosis in bleeding gastric varices.
Gastrointest Endosc 1986;32:264e8.
Thakeb F, Salem SA, Abdallah M, et al. Endoscopic diagnosis of gastric varices.
Endoscopy 1994;26:287e91.
Bretagne JF, Dudicourt JC, Morisot D, et al. Is endoscopic variceal sclerotherapy
effective for the treatment of gastric varices? [abstract]. Dig Dis Sci 1986;31:505S.
Sarin SK. Long-term follow-up of gastric variceal sclerotherapy: an eleven-year
experience. Gastrointest Endosc 1997;46:8e14.
Lebrec D. Pharmacological treatment of portal hypertension: hemodynamic effects
and prevention of bleeding. Pharmacol Ther 1994;61:65e107.
Hayes PC, Davis JM, Lewis JA, et al. Meta-analysis of value of propranolol in
prevention of variceal haemorrhage. Lancet 1990;336:153e6.
Lin HC, Soubrane O, Lebrec O. Prevention of portal hypertension and portosystemic
shunts by early chronic administration of clonidine in conscious portal vein-stenosed
rats. Hepatology 1991;14:325e30.
Sarin SK, Groszmann RJ, Mosca PG, et al. Propranolol ameliorates the development
of portal-systemic shunting in a chronic murine schistosomiasis model of portal
hypertension. J Clin Invest 1991;87:1032e6.
Gut 2010;59:729e735. doi:10.1136/gut.2009.192039
32.
33.
34.
35.
36.
37.
38.
39.
Ideo G, Bellati G, Fesce E, et al. Nadolol can prevent the first gastrointestinal
bleeding in cirrhotics: a prospective, randomized study. Hepatology 1988;8:6e9.
The Italian Multicenter Project for Propranolol in Prevention of Bleeding.
Propranolol prevents first gastrointestinal bleeding in non-ascitic cirrhotic patients.
Final report of a multicenter randomized trial. J Hepatol 1989;9:75e83.
Wu CY, Yeh HZ, Chen GH. Pharmacologic efficacy in gastric variceal
rebleeding and survival: including multivariate analysis. J Clin Gastroenterol
2002;35:127e32.
Lo GH, Lai KH, Cheng JS, et al. A prospective, randomized trial of butyl
cyanoacrylate injection versus band ligation in the management of bleeding gastric
varices. Hepatology 2001;33:1060e4.
Soehendra N, Nam VC, Grimm H, et al. Endoscopic obliteration of large
esophagogastric varices with bucrylate. Endoscopy 1986;18:25e6.
Ramond MJ, Valla D, Mosnier JF, et al. Successful endoscopic obliteration of
gastric varices with butyl cyanoacrylate. Hepatology 1989;10:488e93.
Huang YH, Yeh HZ, Chen GH, et al. Endoscopic treatment of bleeding gastric varices
by N-butyl-2-cyanoacrylate (Histoacryl) injection: long-term efficacy and safety.
Gastrointest Endosc 2000;52:160e7.
Binmoeller KF, Soehendra N. “Superglue”: the answer to variceal bleeding and
fundal varices? Endoscopy 1995;27:392e6.
Ryan BM, Stockbrugger RW, Ryan JM. A pathophysiologic, gastroenterologic, and
radiologic approach to the management of gastric varices. Gastroenterology
2004;126:1175e89.
Sarin SK, Jain AK, Jain M, et al. A randomized controlled trial of cyanoacrylate
versus alcohol injection in patients with isolated fundic varices. Am J Gastroenterol
2002;97:1010e15.
Akahoshi T, Hashizume M, Shimabukuro R, et al. Long-term results of endoscopic
histoacryl injection sclerotherapy for gastric variceal bleeding: a 10-year experience.
Surgery 2002;131:S176e81.
Kim JW, Baik SK, Kim KH, et al. Effect of endoscopic sclerotherapy using N-butyl2-cyanoacrylate in patients with gastric variceal bleeding. Korean J Hepatol
2006;12:394e403.
de Franchis R. Evolving consensus in portal hypertension report of the Baveno
IV consensus workshop on methodology of diagnosis and therapy in portal
hypertension. J Hepatol 2005;43:167e76.
Ramesh J, Limdi JK, Sharma V, et al. The use of thrombin injections in the
management of bleeding gastric varices: a single center experience. Gastrointest
Endosc 2008;68:877e82.
Yang WL, Tripathi D, Therapondos G, et al. Endoscopic use of human thrombin in
bleeding gastric varices. Am J Gastroenterol 2002;97:1381e5.
Datta D, Vlavianos P, Alisa A, et al. Use of fibrin glue (beriplast) in the management
of bleeding gastric varices. Endoscopy 2003;35:675e8.
Tripathi D, Hayes PC. Endoscopic therapy for bleeding gastric varices: to clot or
glue? Gastrointest Endosc 2008;68:883e6.
Wadhawan M, Dubey S, Sharma BC, et al. Hepatic venous pressure gradient in
cirrhosis: correlation with the size of varices, bleeding, ascites, and Child’s status.
Dig Dis Sci 2006;51:2264e9.
Evrard S, Dumonceau JM, Delhaye M, et al. Endoscopic histoacryl obliteration vs.
propranolol in the prevention of esophagogastric variceal rebleeding: a randomized
trial. Endoscopy 2003;35:729e35.
Tripathi D, Therapondos G, Jackson E, et al. The role of the transjugular intrahepatic
portosystemic stent shunt (TIPSS) in the management of bleeding gastric varices:
clinical and haemodynamic correlations. Gut 2002;51:270e4.
Watanabe K, Kimura K, Matsutani S, et al. Portal hemodynamics in patients with
gastric varices. A study in 230 patients with esophageal and/or gastric varices using
portal vein catheterization. Gastroenterology 1988;95:434e40.
Matsumoto A, Hamamoto N, Nomura T, et al. Balloon-occluded retrograde
transvenous obliteration of high-risk gastric fundal varices. Am J Gastroenterol
1999;94:643e9.
Hong CH, Kim HJ, Park JH. Treatment of patients with gastric variceal hemorrhage:
endoscopic N-butyl-2-cyanoacrylate injection versus balloon-occluded retrograde
transvenous obliteration. J Gastroenterol Hepatol 2009;24:372e8.
Lo GH, Liang HL, Chen WC, et al. A prospective, randomized controlled trial of
transjugular intrahepatic portosystemic shunt vs. cyanoacrylate injection in the
prevention of gastric variceal rebleeding. Endoscopy 2007;39:679e85.
Marques P, Maluf-Filho F, Kumar A, et al. Long-term outcomes of acute gastric
variceal bleeding in 48 patients following treatment with cyanoacrylate. Dig Dis Sci
2008;53:544e50.
Tripathi D, Ferguson JW, Therapondos G, et al. Review article: recent advances
in the management of bleeding gastric varices. Aliment Pharmacol Ther
2006;24:1e17.
Tan PC, Hou MC, Lin HC, et al. A randomized trial of endoscopic treatment of acute
gastric variceal hemorrhage: N-butyl-2-cyanoacrylate injection versus band ligation.
Hepatology 2006;43:690e7.
Hwang SS, Kim HH, Park SH, et al. N-butyl-2-cyanoacrylate pulmonary embolism
after endoscopic injection sclerotherapy for gastric variceal bleeding. J Comput Assist
Tomogr 2001;25:16e22.
Yoshida H, Onda M, Tajiri T, et al. Endoscopic findings of bleeding esophagogastric
varices. Hepatogastroenterology 2002;49:1287e9.
Bhasin DK, Sharma BC, Prasad H, et al. Endoscopic removal of sclerotherapy needle
from gastric varix after N-butyl-2-cyanoacrylate injection. Gastrointest Endosc
2000;51:497e8.
735
Stretching to prevent or reduce muscle soreness after
exercise (Review)
Herbert RD, de Noronha M
This is a reprint of a Cochrane review, prepared and maintained by The Cochrane Collaboration and published in The Cochrane Library
2008, Issue 4
http://www.thecochranelibrary.com
Stretching to prevent or reduce muscle soreness after exercise (Review)
Copyright © 2008 The Cochrane Collaboration. Published by John Wiley & Sons, Ltd.
TABLE OF CONTENTS
HEADER . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
PLAIN LANGUAGE SUMMARY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
BACKGROUND . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
OBJECTIVES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
METHODS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
RESULTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
DISCUSSION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
AUTHORS’ CONCLUSIONS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
ACKNOWLEDGEMENTS
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
REFERENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
CHARACTERISTICS OF STUDIES . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
DATA AND ANALYSES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Analysis 1.1. Comparison 1 Pre-exercise stretching, Outcome 1 Soreness on day 0.5 (assessed 6-17 hours post-exercise).
Analysis 1.2. Comparison 1 Pre-exercise stretching, Outcome 2 Soreness on day 1 (assessed 18-29 hours post-exercise).
Analysis 2.1. Comparison 2 Post-exercise stretching, Outcome 1 Soreness on day 0.5 (assessed 6-17 hours post-exercise).
Analysis 2.2. Comparison 2 Post-exercise stretching, Outcome 2 Soreness on day 1 (assessed 18-29 hours post-exercise).
Analysis 3.1. Comparison 3 Either pre- or post-exercise stretching, Outcome 1 Soreness on day 0.5 (assessed 6-17 hours
post-exercise). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Analysis 3.2. Comparison 3 Either pre- or post-exercise stretching, Outcome 2 Soreness on day 1 (assessed 18-29 hours
post-exercise). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
post-exercise). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
post-exercise). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
post-exercise). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
post-exercise). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
APPENDICES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
WHAT’S NEW . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
HISTORY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
CONTRIBUTIONS OF AUTHORS . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
DECLARATIONS OF INTEREST . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
SOURCES OF SUPPORT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
INDEX TERMS
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1
1
2
2
3
3
5
7
8
8
9
11
19
20
21
21
22
22
23
23
24
24
25
25
26
26
27
27
28
29
29
29
30
31
31
31
31
31
i
[Intervention Review]
Stretching to prevent or reduce muscle soreness after
exercise
Robert D Herbert1 , Marcos de Noronha2
1 The
George Institute for International Health, Camperdown, Australia. 2 School of Physiotherapy, University of Sydney, Lidcombe,
Australia
Contact address: Robert D Herbert, The George Institute for International Health, PO Box M201, Missendown Road, Camperdown,
New South Wales, 2050, Australia. [email protected].
Editorial group: Cochrane Bone, Joint and Muscle Trauma Group.
Publication status and date: Edited (no change to conclusions), published in Issue 4, 2008.
Review content assessed as up-to-date: 15 August 2006.
Citation: Herbert RD, de Noronha M. Stretching to prevent or reduce muscle soreness after exercise. Cochrane Database of Systematic
Reviews 2007, Issue 4. Art. No.: CD004577. DOI: 10.1002/14651858.CD004577.pub2.
ABSTRACT
Background
Many people stretch before or after (or both) engaging in athletic activity. Usually the purpose is to reduce risk of injury, reduce soreness
after exercise, or enhance athletic performance.
Objectives
The aim of this review was to determine effects of stretching before or after exercise on the development of post-exercise muscle soreness.
Search strategy
We searched the Cochrane Bone, Joint and Muscle Trauma Group Specialised Register (to April 2006), the Cochrane Central Register
of Controlled Trials (The Cochrane Library 2006, Issue 2), MEDLINE (1966 to May 2006), EMBASE (1988 to May 2006), CINAHL
(1982 to May 2006), SPORTDiscus (1949 to May 2006), PEDro (to May 2006) and reference lists of articles.
Selection criteria
Eligible studies were randomised or quasi-randomised studies of any pre-or post-exercise stretching technique designed to prevent or
treat delayed-onset muscle soreness (DOMS), provided the stretching was conducted soon before or soon after exercise. To be eligible
studies must have assessed muscle soreness or tenderness.
Data collection and analysis
Methodological quality of the studies was assessed using the Cochrane Bone, Joint and Muscle Trauma Group’s methodological quality
assessment tool. Estimates of effects of stretching were converted to a common 100-point scale. Outcomes were pooled in a fixed-effect
meta-analysis.
Main results
Of the 10 included studies, nine were carried out in laboratory settings using standardised exercise protocols and one involved postexercise stretching in footballers. All participants were young healthy adults. Three studies examined the effects of stretching before
exercise and seven studies investigated the effects of stretching after exercise. Two studies, both of stretching after exercise, involved
1
repeated stretching sessions at intervals of greater than two hours. The duration of stretching applied in a single session ranged from
40 to 600 seconds.
All studies were small (between 10 and 30 participants received the stretch condition) and of questionable quality.
The effects of stretching reported in individual studies were very small and there was a high degree of consistency of results across
studies. The pooled estimate showed that pre-exercise stretching reduced soreness one day after exercise by, on average, 0.5 points on a
100-point scale (95% CI -11.3 to 10.3; 3 studies). Post-exercise stretching reduced soreness one day after exercise by, on average, 1.0
points on a 100-point scale (95% CI -6.9 to 4.8; 4 studies). Similar effects were evident between half a day and three days after exercise.
Authors’ conclusions
The evidence derived from mainly laboratory-based studies of stretching indicate that muscle stretching does not reduce delayed-onset
muscle soreness in young healthy adults.
PLAIN LANGUAGE SUMMARY
Stretching to prevent or reduce muscle soreness after exercise
Many people stretch prior to or after engaging in physical activities such as sport. Usually the purpose is to reduce the risk of injury,
reduce soreness after exercise, or enhance athletic performance.
The review located 10 relevant randomised trials looking at the effect of stretching before or after physical activity on muscle soreness.
The trials were mostly small and of questionable quality. Nine were conducted in laboratories using standardised exercises. Only one
study examined the effect of stretching on muscle soreness after sport. Three of the studies examined the effects of stretching before
physical activity and seven examined effects of stretching after physical activity.
The 10 studies produced very consistent findings. They showed there was minimal or no effect on the muscle soreness experienced
between half a day and three days after the physical activity. Effects of stretching on effect on other outcomes such as injury and
performance were not examined in this review.
BACKGROUND
Many people stretch before or after (or before and after) engaging
in athletic activity. Usually the purpose is to reduce the risk of
injury, reduce soreness after exercise, or enhance athletic performance (Cross 1999; de Vries 1961; Gleim 1990; Gleim 1997).
This review focuses on the effects of muscle stretching on muscle
soreness. Another Cochrane review (Yeung 2003) has examined
whether stretching prevents injury, and several non-Cochrane systematic reviews have examined whether stretching prevents muscle soreness or injury, or enhances performance (Herbert 2002;
Shrier 2004; Thacker 2004; Weldon 2003).
The muscle soreness that is the focus of this review is sometimes
called delayed-onset muscle soreness to differentiate it from soreness that occurs as muscle fatigue, or immediately after muscles
have been fatigued. Delayed-onset muscle soreness (hereafter re-
ferred to simply as muscle soreness) is usually caused by unaccustomed exercise, particularly exercise that requires primarily eccentric muscle contraction. Eccentric contractions occur when the
muscle lengthens as it contracts. Typically the soreness arises within
the first day after exercise and peaks in intensity at around 48
hours (Balnave 1993; Bobbert 1986; Wessel 1994). The soreness
is usually associated with tenderness and may be associated with
swelling (Bobbert 1986; Wessel 1994).
The series of events that ultimately cause muscle soreness are only
partly understood. The initial event is probably mechanical disruption of sarcomeres (contractile units within muscle fibres; Proske
2001). This causes swelling of damaged muscle fibres and initiates
an inflammatory response, which could excite nociceptors (receptors capable of transmitting information about pain; Lieber 2002).
2
Muscle soreness is usually only experienced when the muscle contracts or is stretched. This indicates that, while swelling of muscle
fibres and inflammation may be necessary for muscle soreness to
occur, swelling and inflammation are not sufficient to cause soreness on their own. Muscle soreness is exacerbated by vibration over
the belly of the muscle, and pressure thresholds increase when large
diameter afferents are blocked (large diameter afferents are nerve
cells that conduct information about touch and limb position),
indicating that excitation of large diameter muscle afferents, probably primary spindle afferents from stretch receptors in muscles,
is involved in generating the sensation of soreness (Weerakkody
2001).
The practice of stretching to prevent muscle soreness was encouraged by early investigators of muscle soreness who thought that
unaccustomed exercise caused muscle spasm (de Vries 1961). Muscle spasm was believed to impede blood flow to the muscle, causing ischaemic pain and further spasm. Stretching the muscle was
thought to restore blood flow to the muscle and interrupt the painspasm-pain cycle. The muscle spasm theory of muscle soreness has
since been discredited (Bobbert 1986), but the practice of stretching persists.
Typically people who stretch to prevent muscle soreness do so prior
to exercise, but some people stretch after exercise. Usually each atrisk muscle is stretched for between 15 seconds and 2 minutes,
once or several times. Some proponents of stretching recommend
applying a sustained stretch to the relaxed muscle (de Vries 1961;
this is called static stretching), but others recommend more elaborate techniques such as the ’contract-relax-agonist contract’ technique (Feland 2001). This technique and related techniques (collectively called proprioceptive neuromuscular facilitation (PNF)
techniques) involve contracting the muscle strongly before applying a stretch.
clinical trials of pre-or post-exercise stretching techniques designed
to prevent or treat delayed-onset muscle soreness.
Types of participants
Trials involving participants of any age group, and of either sex.
Types of interventions
Any pre-or post-exercise stretching technique designed to prevent
or treat delayed-onset muscle soreness, provided the stretching was
conducted soon before or soon after exercise of any type.
Types of outcome measures
Measures of muscle soreness (pain) or tenderness (pain on palpation of the muscle).
Search methods for identification of studies
We searched the Cochrane Bone, Joint and Muscle Trauma Group
Specialised Register (12 April 2006), the Cochrane Central Register of Controlled Trials (The Cochrane Library 2006, Issue 2)
(see Appendix 1), MEDLINE (OVID WEB; 1966 to 16 May
2006), EMBASE (OVID WEB; 1988 to 16 May 2006), CINAHL
(OVID WEB; 1982 to 16 May 2006), SPORTDiscus (OVID
WEB; 1949 to 16 May 2006), PEDro - Physiotherapy Evidence
Database (www.pedro.fhs.usyd.edu.au/accessed 16 May 2006)
and reference lists of articles.
In MEDLINE (OVID WEB) the subject-specific search in
Appendix 2 was used. As it was anticipated that only a small
number of references would be identified, the results were not
combined with a trial search strategy. This strategy was modified
slightly for use in the other OVID databases.
In PEDro, two searches were conducted (see Appendix 3).
OBJECTIVES
This review aims to determine the effect of stretching before or
after exercise on the development of post-exercise muscle soreness.
METHODS
Criteria for considering studies for this review
Types of studies
We considered any randomised or quasi-randomised (method of
allocating participants to a treatment that is not strictly random
e.g. by date of birth, hospital record number and alternation)
Data collection and analysis
Selection of trials
The two authors screened search results for potentially eligible
studies. When titles and abstracts suggested a study was potentially
eligible for inclusion a paper copy of the report was obtained. Disagreements between the two people regarding a study’s eligibility
were resolved by discussion or, where necessary, by a third person.
Assessment of methodological quality
The methodological quality of the studies was assessed using the
Cochrane Bone, Joint and Muscle Trauma Group quality assessment tool (Table 1). Two people independently assessed study
quality in this way. Disagreements were resolved by discussion.
Where necessary, a third person was consulted. It was anticipated
that several of the included studies would be laboratory-based studies which were unlikely to describe elements thought important
3
in clinical trials. These studies may appear methodologically weak
when assessed with a tool designed for clinical trials. Nonetheless
it was thought useful to document the degree to which they conform with best practice clinical trial design.
Table 1. Bone, Joint and Muscle Trauma Group methodological quality assessment tool
Item
Criterion
A
Was the assigned treatment adequately concealed prior to allocation?
2 = clearly yes (method did not allow disclosure of assignment).
1 = not sure (small but possible chance of disclosure of assignment or unclear).
0 = clearly no (quasi-randomised or open list/tables).
B
Were the outcomes of patients/participants who withdrew described and included in the analysis (intention to treat)?
2 = withdrawals well described and accounted for in analysis.
1 = withdrawals described and analysis not possible.
0 = no mention, inadequate mention, or obvious differences and no adjustment.
C
Were the outcome assessors blinded to treatment status?
2 = effective action taken to blind assessors.
1 = small or moderate chance of unblinding of assessors.
0 = not mentioned or not possible.
D
Were the treatment and control group comparable at entry?
2 = good comparability of groups, or confounding adjusted for in analysis.
1 = confounding small; mentioned but not adjusted for.
0 = large potential for confounding, or not discussed.
E
Were the participants blind to assignment status after allocation?
2 = effective action taken to blind participants.
1 = small or moderate chance of unblinding of participants.
0 = not possible, or not mentioned (unless double-blind), or possible but not done.
F
Were the treatment providers blind to assignment status?
2 = effective action taken to blind treatment providers.
1 = small or moderate chance of unblinding of treatment providers.
0 = not possible, or not mentioned (unless double-blind), or possible but not done.
G
Were care programmes, other than the trial options, identical?
2 = care programmes clearly identical.
1 = clear but trivial differences.
0 = not mentioned or clear and important differences in care programmes.
H
Were the inclusion and exclusion criteria clearly defined?
2 = clearly defined.
1 = inadequately defined.
0 = not defined.
4
Table 1. Bone, Joint and Muscle Trauma Group methodological quality assessment tool
(Continued)
I
Were the interventions clearly defined? (This item was optional)
2 = clearly defined interventions are applied with a standardised protocol.
1 = clearly defined interventions are applied but the application protocol is not standardised.
0 = intervention and/or application protocol are poorly or not defined.
J
Were the outcome measures used clearly defined? (by outcome)
2 = clearly defined.
1 = inadequately defined.
0 = not defined.
K
Were diagnostic tests used in outcome assessment clinically useful? (by outcome)
2 = optimal.
1 = adequate.
0 = not defined, not adequate.
L
Was the surveillance active, and of clinically appropriate duration?
2 = active surveillance and appropriate duration.
1 = active surveillance, but inadequate duration.
0 = surveillance not active or not defined.
Data extraction
Soreness or tenderness data from stretch and comparison groups
were extracted from study reports using a pro forma. This was
done by the authors; again, disagreements were resolved by discussion or a third person. It was expected that some studies would
report soreness data and some would report tenderness data. Some
studies reported raw scores, whereas others report change scores
or covariance-adjusted scores. It was also expected that outcomes
would usually be measured at a series of times following exercise.
Where available, data were extracted for each of a series of 12-hour
intervals following exercise (6-17 hours, 18-29 hours, 30-41 hours,
etc). Only one outcome was recorded from each study for each
time. Where multiple outcomes were available, soreness data was
preferred to tenderness data. Covariate-adjusted data were most
preferred, then change scores, then raw scores. Where within-subject studies (studies in which participants’ limbs were allocated to
stretch and control conditions) or cross-over studies (studies in
which each participant was sequentially exposed to both stretch
and control conditions) did not provide estimates of the variance
of differences between conditions or of the correlations between
outcomes under the two conditions being compared (Elbourne
2002) the variance of the difference between conditions was estimated by assuming the data were independent.
Analysis
Pain and tenderness data were typically reported on 100 mm visual
analogue scales or 10 point scales. To facilitate pooling, data were
converted to a common 100-point scale. The effect of stretching was estimated from each study using the difference between
mean pain levels in stretch and comparison groups. Where there
was apparent qualitative homogeneity in participants, interventions and outcome measures, meta-analysis was considered. Metaanalysis was conducted using the RevMan Analysis program in
Review Manager (RevMan 2003). The effects of stretching was
described as a weighted mean difference. Data were inspected for
statistical heterogeneity. As there was no evidence of heterogeneity
of effect (Cochran Q, P < 0.1; Fleiss 1993), a fixed-effect model
was used to pool findings across studies. Effects of pre- and postexercise stretching were initially estimated separately but, as there
was no evidence of heterogeneity, they were combined in a secondary analysis. A post-hoc analysis was conducted to obtain separate estimates from laboratory-based studies and the one community-based study. An additional post-hoc analysis was conducted
to obtain separate estimates from studies of static stretch and the
one study that used another method of stretching.
RESULTS
5
Description of studies
See: Characteristics of included studies; Characteristics of excluded
studies.
The search retrieved 25 potentially eligible studies of which 10
were included (Buroker 1989; Dawson 2005; Gulick 1996; High
1989; Johansson 1999; Maxwell 1988; McGlynn 1979; Terry
1985; Wessel 1994a; Wessel 1994b). The titles or abstracts of a
further four papers appeared potentially eligible but could not be
retrieved (Costa 1997; Buckenmeyer 1998; Lin 1999; Lu 1992)
and still await assessment.
Included studies
Participants of all included studies were young healthy adults (age
less than 40 years or mean age less than 26 years); no studies investigated effects of stretching on muscle soreness in children or
middle-aged or older adults. All but one of the studies were carried out in laboratory settings and involved standardised exercise.
The exception (Dawson 2005) investigated the effect of post-game
stretching on muscle soreness in Australian rules football players.
Three studies examined the effect of stretching before exercise (
High 1989; Johansson 1999; Wessel 1994a); the remaining seven
studies investigated the effects of stretching after exercise. The
stretch was of the knee extensor muscles (Buroker 1989; High
1989; Maxwell 1988; Terry 1985), hamstring muscles (Johansson
1999; Wessel 1994a; Wessel 1994b), ankle plantarflexor muscles (
Buroker 1989), wrist extensor muscles (Gulick 1996), elbow flexor
muscles (McGlynn 1979), or “legs and back” muscles (Dawson
2005). All but one of the studies examined the effects of static
stretching; one study examined the effects of a PNF technique
(“hold-relax”; Terry 1985). Two studies, both of stretching after
exercise, involved repeated stretching sessions (Buroker 1989: 12
sessions; McGlynn 1979: 5 sessions). The duration of stretching
applied in a single session ranged from 40 seconds (Terry 1985)
to 600 seconds (Gulick 1996), with a median of 390 seconds.
In six studies (Buroker 1989; Dawson 2005; Gulick 1996; High
1989; Maxwell 1988; McGlynn 1979) participants were randomly
allocated to stretch and control conditions. One of these (Dawson
2005) was a cross-over study in which each subject experienced
both the stretch and control conditions. The remaining four studies (Johansson 1999; Terry 1985; Wessel 1994a; Wessel 1994b)
stratified by participant and randomised legs to stretch and control conditions. Thus five studies (the single cross-over study and
the four studies that compared right and left legs) provided paired
comparisons. In all but two of these sufficient data were supplied
to take account of the paired nature of the data. In the studies
by Wessell and Wan (Wessel 1994a; Wessel 1994b) the estimates
from an analysis which accounted for the paired nature of the data
were less precise (Wessel 1994a) or had similar precision (Wessel
1994b) to estimates obtained by ignoring the paired nature of the
data, so the data were treated as if they were independent. Data
from the other two studies (Dawson 2005; Johansson 1999) were
treated as independent. See ’Characteristics of included studies’
for further details.
One study (Gulick 1996) reported only pooled data from all
groups and another (Maxwell 1988) did not provide any data
that could be used to calculate precison. (Both studies concluded
stretching had no effect.) Consequently these studies did not contribute to the meta-analysis.
Excluded studies
Eleven studies were excluded (see ’Characteristics of excluded studies’).
Risk of bias in included studies
Table 2 summarises the methodological quality assessments. The
reviewers were not always able to agree on how particular items
were rated; these items are marked as “No consensus” in Table
2. All included studies were apparently randomised. None of the
studies described if the allocation schedule was concealed. There
was no blinding in any studies: this is because it is difficult or
impossible in studies of stretching to blind the person supervising
the stretching exercises and the person doing the stretches and, as
the outcome (soreness) must be self-reported, the assessor cannot
be blinded. None of the studies clearly indicated that analysis was
by intention to treat and in one study (Dawson 2005) analysis
was explicitly not by intention to treat. Only one study explicitly
reported completeness of follow up (Dawson 2005 reported data
were available for 31 of 34 player-conditions), although two further studies (High 1989; Terry 1985) provided data that implied
follow up was complete. It is likely that follow up was complete
or nearly complete in most or all studies because the follow-up
period was very short (three days or less) and outcome measures
were not onerous.
Table 2. Quality of trials. Rows correspond to those in Table 01
Buroker
1989
Dawson
2005
Gulick
1996
High
1989
Johansson Maxwell
1999
1988
McGlynn
1979
Terry
1985
Wessel
1994a
Wessel
1994b
1
0
1
1
1
1
1
1
1
1
6
Table 2. Quality of trials. Rows correspond to those in Table 01
(Continued)
0
0
0
0
No
consensus
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
0
0
0
0
0
0
2
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
2
2
2
2
2
2
2
2
2
2
0
No
consensus
1
2
0
2
2
0
0
2
1
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
Effects of interventions
One study (Gulick 1996) reported only pooled data from all
groups and another (Maxwell 1988) did not provide any data that
could be used to calculate precision. Consequently these studies
did not contribute to the meta-analysis. (Both studies concluded
stretching had no effect.) The following results are based on data
from the remaining studies.
The mean effects of stretching reported in individual studies were
too small to be of clinical relevance. There was a remarkable degree of consistency (homogeneity) of results across studies (I2 for
all pooled estimates was 0%). The pooled estimate was that preexercise stretching reduced soreness one day after exercise by, on
average, 0.5 points on a 100-point scale (95% CI -11.3 to 10.3; 3
studies; Analysis 01.02). Post-exercise stretching reduced soreness
one day after exercise by, on average, 1.0 points on a 100-point
scale (95% CI -6.9 to 4.8; 4 studies; Analysis 02.02). Similar effects were evident between half a day and three days after exercise.
As there was no sign of statistical heterogeneity we pooled the preexercise and post-exercise studies. The pooled estimates are that
pre- and post-exercise stretching reduces soreness, on average, by
0.9 points on a 100-point scale at one day (95% CI -6.1 to 4.2; 7
studies; Analysis 03.02), increases soreness by 1.0 points on a 100point scale at two days (95% CI -4.1 to 6.2; 7 studies; Analysis
03.04) and decreases soreness by 0.3 points on a 100-point scale
at three days (95% CI -6.8 to 6.2; 5 studies; Analysis 03.06).
Similar estimates of effects were obtained in the only study that
tested PNF stretching. Terry 1985 showed the effect of PNF
stretching was to reduce soreness by 1.1 points on a 100-point
scale at both one and two days after exercise (95% CIs of -13.0
to 10.8; and -13.3 to 11.1 respectively). Also similar effects were
obtained in the one study of stretching after sport. Dawson 2005
showed that stretching increased muscle soreness by 3.0 points on
a 100-point scale (95% CI -7.1 to 13.1) at 15 hours and by 4.0
points (95% CI -5.2 to 13.2) at 48 hours after an Australian rule
football match.
DISCUSSION
The evidence from randomised studies indicates that stretching
soon before or soon after exercise does not lessen muscle soreness
on subsequent days.
7
This conclusion is based on studies that rate poorly on conventional scales of study quality. None of the reviewed studies explicitly concealed allocation, none were blinded, and none explicitly
analysed by intention to treat. Nonetheless there are some reasons
to believe that the findings may not be seriously biased: First, there
is a high degree of consistency across studies in estimates of the
effect of stretching. Also, biases due to non-concealment, lack of
blinding and failure to analyse by intention to treat typically produce exaggerated effects (Chalmers 1983; Colditz 1989; Moher
1998; Schulz 1995), yet this review found consistent evidence of
near-zero effects. The high degree of consistency of estimates across
studies suggests that alternative weightings (based, for example,
on different approaches to estimating precision from studies that
used within-subject designs) would produce very similar pooled
estimates of effects.
All but one of the eligible studies were laboratory-based studies. These studies assessed whether supervised stretching reduced
the soreness produced by standardised exercise involving eccentric contractions of a particular muscle group. Such studies can be
considered to have an “explanatory” orientation because they tell
us about what the effects of stretching could be when administered under ideal conditions. It might have been expected that the
effects of stretching would be smaller in community settings. In
community settings, unlike the laboratory settings of these studies, stretching is not usually supervised, so it may be carried out
sub-optimally. Also, the everyday experience of muscle soreness
may be of soreness in many muscle groups, and it may be difficult to effectively stretch all of the muscle groups that might become sore. These considerations suggest the effects of stretching
might be smaller in community settings than in laboratory settings. However the estimates from nine laboratory-based studies
and one community-based study suggest there are similar (nearzero) effects in both settings.
The practice of stretching before or after exercise may have been
arisen because of the mistaken belief, prevalent in the 1960s and
1970s, that muscle soreness was due to muscle spasm (de Vries
1961). This view has since been discredited (Bobbert 1986). Contemporary beliefs about the mechanisms of muscle soreness do
not suggest any means by which stretching before or after exercise
could reduce the soreness that follows that exercise session. However if, as has been proposed, muscle soreness is due to excessive
elongation of some sarcomeres within muscle fibres (Proske 2001),
it is conceivable that any intervention which increased the number of sarcomeres in series in muscle fibres, or which increased the
length or compliance of tendons, could reduce sarcomere strains
and lessen muscle damage associated with unaccustomed eccentric
muscle contractions. This suggests the possibility that stretching
may be more effective for people who have very short muscles, or
if the stretching is repeated for weeks or months or years. To date,
however, randomised trials have only examined the short-term effects of one or a few sessions of stretching, and no randomised
trials have specifically investigated interactions between effects of
stretching and flexibility.
The best available evidence indicates stretching does not reduce
muscle soreness. However there are other justifications for stretching. Some evidence suggests that once muscle soreness has developed stretching may provide a transient relief of soreness (Reisman
2005): some people stretch to reduce risk of injury, others stretch
to enhance athletic or sporting performance, and yet others stretch
because it gives them a sense of well-being. The current review
does not provide any evidence of an effect or otherwise of stretching on risk of injury, performance or well-being.
AUTHORS’ CONCLUSIONS
Implications for practice
The available evidence from randomised trials carried out mainly
in laboratory settings consistently suggests that stretching before
or after exercise does not prevent muscle soreness in young healthy
adults.
Implications for research
Arguably the findings of this review are clear enough that further
research into the effects of stretching on muscle soreness is not
necessary. We see some merit in this view. However, in our opinion
it would be useful to conduct further trials of the effects of longerterm stretching programs, in community-based populations and
for people with reduced flexibility.
ACKNOWLEDGEMENTS
This review is based on an earlier “non-Cochrane” review conducted by Rob Herbert and Michael Gabriel and published in
2002 (Herbert RD, Gabriel M. BMJ 2002;325(7362):468-72).
Through that earlier work Michael Gabriel indirectly contributed
to the current review. His contribution is gratefully acknowledged.
The authors thank Helen Handoll, Vicki Livingstone, Janet Wale,
Gisela Sole and Lindsey Shaw for helpful comments on the review,
and Lisa Harvey for assistance with data extraction.
8
REFERENCES
References to studies included in this review
Buroker 1989 {published data only}
Buroker KC, Schwane JA. Does postexercise static stretching
alleviate delayed muscle soreness?. Physician and Sportsmedicine
1989;17(6):65–83.
Dawson 2005 {published data only}
Dawson B, Gow S, Modra S, Bishop D, Stewart G. Effects of
immediate post-game recovery procedures on muscle soreness,
power and flexibility levels over the next 48 hours. Journal of
Science and Medicine in Sport 2005;8(2):210–21.
Gulick 1996 {published data only}
Gulick DT, Kimura IF, Sitler M, Paolone A, Kelly JD. Various
treatment techniques on signs and symptoms of delayed onset
muscle soreness. Journal of Athletic Training 1996;31(2):145–52.
High 1989 {published data only}
High DM, Howley ET, Franks BD. The effects of static stretching
and warm-up on prevention of delayed-onset muscle soreness.
Research Quarterly for Exercise and Sport 1989;60(4):357–61.
Johansson 1999 {published data only}
Johansson PH, Lindström L, Sundelin G, Lindström B. The effects
of preexercise stretching on muscular soreness, tenderness and force
loss following heavy eccentric exercise. Scandinavian Journal of
Medicine and Science in Sports 1999;9(4):219–25.
Maxwell 1988 {published data only}
Maxwell S, Kohn S, Watson A, Balnave RJ. Is stretching effective in
the prevention of or amelioration of delayed-onset muscle soreness?.
In: Torode M editor(s). The athlete maximising participation and
minimising risk. Sydney: Cumberland College of Health Sciences,
1988:109–18.
McGlynn 1979 {published data only}
McGlynn GH, Laughlin NT, Rowe V. Effect of electromyographic
feedback and static stretching on artificially induced muscle
soreness. American Journal of Physical Medicine 1979;58(3):139–48.
Terry 1985 {published data only}
∗
Terry L. Stretching and muscle soreness. An investigation into the
effects of hold-relax stretch on delayed post-exercise soreness in the
quadriceps muscle - a pilot study [thesis]. Adelaide (Australia): South
Australian Institute of Technology, 1985.
Terry L. Stretching and muscle soreness: an investigation into the
effects of a hold-relax stretch on delayed post-exercise soreness in
the quadriceps muscle: a pilot study [abstract]. Australian Journal of
Physiotherapy Australian Journal of Physiotherapy 1987;33(1):69.
Wessel 1994a {published data only}
Wessel J, Wan A. Effect of stretching on the intensity of delayedonset muscle soreness. Clinical Journal of Sport Medicine 1994;4(2):
83–7.
Wessel 1994b {published data only}
83–7.
References to studies excluded from this review
Abraham 1977 {published data only}
Abraham WM. Factors in delayed muscle soreness. Medicine and
Science in Sports 1997;9(1):11–20.
Bale 1991 {published data only}
Bale P, James H. Massage, warmdown and rest as recuperative
measures after short term intense exercise. Physiotherapy in Sport
1991;13(2):4–7.
de Vries 1966 {published data only}
de Vries HA. Quantitative electromyographic investigation of the
spasm theory of muscle pain. American Journal of Physical Medicine
1966;45(3):119–34.
Jayaraman 2004 {published data only}
Jayaraman RC, Reid RW, Foley JM, Prior BM, Dudley GA,
Weingand KW, et al.MRI evaluation of topical heat and static
stretching as therapeutic modalities for the treatment of eccentric
exercise-induced muscle damage. European Journal of Applied
Physiology 2004;93(1-2):30–8.
Lightfoot 1997 {published data only}
Lightfoot JT, Char D, McDermott J, Goya C. Immediate
postexercise massage does not attenuate delayed onset muscle
soreness. Journal of Strength and Conditioning Research 1997;11:
119–24.
Lund 1998 {published data only}
Lund H, Vestergaard-Poulsen, Kanstrup I-L, Sejrsen P. The effect of
passive stretching on delayed onset muscle soreness, and other
detrimental effects following exercise. Scandinavian Journal of
Medicine and Science in Sports 1998;8:216–21.
Prentice 1982 {published data only}
Prentice WE. Electromyographic analysis of the effectiveness of
heat or cold and stretching for inducing relaxation in injured
muscle. Journal of Orthopaedic and Sports Physical Therapy 1982;3
(3):133–40.
Rahnama 2005 {published data only}
Rahnama N. Rahmani-Nia F. Ebrahim K. The isolated and
combined effects of selected physical activity and ibuprofen on
delayed-onset muscle soreness. Journal of Sports Sciences 2005;23
(8):843–850.
Reilly 2002 {published data only}
Reilly T, Rigby M. Effect of an active warm-down following
competitive soccer. In: Spinks W, Reilly T, Murphy A editor(s).
Science and Football IV. Proceedings of the 4th World Congress of
Science and Football; 1999 Feb 22-26; Sydney, NSW. London:
Routledge, 2002:226–9.
Rodenburg 1994 {published data only}
Rodenburg JB, Steenbeek, Schiereck P, Bar PR. Warm-up,
stretching and massage diminish harmful effects of eccentric
exercise. International Journal of Sports Medicine 1994;15(7):414–9.
Smith 1993 {published data only}
Smith LL, Brunetz MH, Chenier TC, McCammon MR, Houmard
JA, Franklin ME, et al.The effects of static and ballistic stretching
on delayed onset muscle soreness. Research Quarterly for Exercise
and Sport 1993;1:103–7.
9
References to studies awaiting assessment
Buckenmeyer 1998 {published data only}
Buckenmeyer P, Kokkinidis E. Machairidou M, Tsamourtas A. The
effect of static stretching and cryotherapy on the recovery of
delayed muscle soreness [Greek]. Exercise & Society Journal of Sport
Science (Komotini) 1998;(19):45–53.
Costa 1997 {published data only}
Costa MC, Cabral JG, Muniz GMA, Silva EF, Silva MG, Duarte
JAR, et al.Effects of stretching after exercise on some indirect
markers muscle damage. 9th European Congress on Sports
Medicine; 1997 Sept 23-26; Porto, Portugal. 1997:1.
Elbourne 2002
Elbourne DR, Altman DG, Higgins JPT, Curtin F, Worthington
HV, Vail A. Meta-analyses involving cross-over trials:
methodological issues. International Journal of Epidemiology 2002;
31(1):140–9.
Feland 2001
Feland JB, Myrer JW, Merrill RM. Acute changes in hamstring
flexibility: PNF versus static stretch in senior athletes. Physical
Therapy in Sport 2001;2(4):186–93.
Fleiss 1993
Fleiss JL. The statistical basis of meta-analysis. Statistical Methods in
Medical Research 1993;2:121–45.
Lin 1999 {published data only}
Lin WH. The effects of massage, stretch and meloxicam on delayed
onset muscle soreness [Masters thesis]. Taiwan: National College of
Physical Education and Sports Taoyuan, 1999.
Gleim 1990
Gleim GW, Stachenfeld NS, Nicholas JA. The influence of
flexibility on the economy of walking and jogging. Journal of
Orthopedic Research 1990;8:814–23.
Lu 1992 {published data only}
∗
Lu D, Fan JY, Qu Z. An immuno-electron microscopic study of
the effect of acupuncture and static stretch on contractile structural
alteration of skeletal muscle after strenuous exercise. Sport Science
(Beijing) 1992;12(6):47–51.
Lu D, Fan JY, Qu Z. An immuno-electron microscopic study of the
effect of acupuncture and static stretch on ultrastructural alteration
of M-line of skeletal muscle after strenuous exercise. Sports Science
(Beijing) 1992;12(6):52–9.
Lu D, Fan JY, Qu Z. An immuno-electron microscopic study of the
effect of acupuncture and static stretch on ultrastructural alteration
of Z-band in human skeletal muscle after strenuous exercise. Sports
Science (Beijing) 1992;12(6):60–5.
Gleim 1997
Gleim GW, McHugh MP. Flexibility and its effects on sports injury
and performance. Sports Medicine 1997;24:289–99.
Additional references
Balnave 1993
Balnave CD, Thomson MW. Effect of training on eccentric
exercise-induced muscle damage. Journal of Applied Physiology
1993;75(4):1545–51.
Bobbert 1986
Bobbert MF, Hollander AP, Huijing PA. Factors in delayed onset
muscle sorenesss of man. Medicine and Science in Sports and Exercise
1986;18(1):75–81.
Chalmers 1983
Chalmers TC, Celano P, Sacks HS, Smith H. Bias in treatment
assignment in controlled clinical trials. New England Journal of
Medicine 1983;309(22):1358–61.
Colditz 1989
Colditz GA, Miller JN, Mosteller F. How study design affects
outcomes in comparisons of therapy. I: Medical. Statistics in
Medicine 1989;8(4):441–54.
Cross 1999
Cross KM, Worrell TW. Effects of a static stretching program on
the incidence of lower extremity musculotendinous strains. Journal
of Athletic Training 1999;34:11–4.
de Vries 1961
de Vries HA. Prevention of muscular distress after exercise. Research
Quarterly 1961;32:177–85.
Lieber 2002
Lieber RL, Friden J. Morphologic and mechanical basis of delayedonset muscle soreness. Journal of the American Academy of
Orthopaedic Surgeons 2002;10(1):67–73.
Moher 1998
Moher D, Pham B, Jones A, Cook DJ, Jadad AR, Moher M, et
al.Does quality of reports of randomised trials affect estimates of
intervention efficacy reported in meta-analyses?. Lancet 1995;352
(9128):609–13.
Proske 2001
Proske U, Morgan DL. Muscle damage from eccentric exercise:
mechanism, mechanical signs, adaptation and clinical applications.
Journal of Physiology 2001;537(2):333–45.
Reisman 2005
Reisman S, Walsh LD, Proske U. Warm-up stretches reduce
sensations of stiffness and soreness after eccentric exercise. Medicine
and Science in Sports and Exercise 2005;37(6):929–36.
RevMan 2003
The Nordic Cochrane Centre, The Cochrane Collaboration.
Review Manager (RevMan). 4.2 for Windows. Copenhagen: The
Nordic Cochrane Centre, The Cochrane Collaboration, 2003.
Schulz 1995
Schulz KF, Chalmers I, Hayes RJ, Altman DG. Empirical evidence
of bias. Dimensions of methodological quality associated with
estimates of treatment effects in controlled trials. JAMA 1995;273
(5):408–12.
Shrier 2004
Shrier I. Does stretching improve performance?A systematic and
critical review of the literature. Clinical Journal of Sport Medicine
2004;14(5):267–73.
Thacker 2004
Thacker SB, Gilchrist J, Stroup DF, Kimsey CD. The impact of
stretching on sports injury risk: a systematic review of the literature.
Medicine and Science in Sports and Exercise 2004;36(3):371–8.
10
Weerakkody 2001
Weerakkody NS, Whitehead NP, Canny BJ, Gregory JE, Proske U.
Large-fibre mechanoreceptors contribute to muscle soreness after
eccentric exercise. Journal of Pain 2001;2(4):209–19.
Weldon 2003
Weldon SM, Hill RH. The efficacy of stretching for prevention of
exercise-related injury: a systematic review of the literature. Manual
Therapy 2003;8(3):141–50.
Wessel 1994
82–7.
Yeung 2003
Yeung EW, Yeung SS. Interventions for preventing lower limb softtissue injuries in runners. Cochrane Database of Systematic Reviews
2003, Issue 4. [DOI: 10.1002/14651858.CD001256]
References to other published versions of this review
Herbert 2002
Herbert RD, Gabriel M. Effects of pre- and post-exercise stretching
on muscle soreness, risk of injury and athletic performance: a
systematic review. BMJ 2002;325(7362):468–72.
∗
Indicates the major publication for the study
11
CHARACTERISTICS OF STUDIES
Characteristics of included studies [ordered by study ID]
Buroker 1989
Methods
Randomised (between subjects) design
Participants
23 participants (16 men and 7 women) aged 18-33 years with no known contraindications to exercise.
None was highly physically trained and none were extremely active in the six weeks preceding the study.
Interventions
Control group (N = 8): no stretching.
One stretch group 1 (N = 7): stretched left knee extensors after exercise.
A second stretch group (N = 8): stretched left knee extensors and right plantarflexors after exercise.
Stretches (10 30-second static stretches, each separated by 10 seconds rest, to noticeable tension or slight
discomfort)were performed at 2-hour intervals for the first 24 hours and 4-hour intervals for the following
48 hours except that participants were allowed to sleep for 8 hours each night.
Outcomes
The following measures were taken on days 1, 2 and 3:
1. Soreness on a 0-6 scale (0 = no pain, light pain only on palpation, 4 = moderate pain, stiffness and/or
weakness, especially on movement, 6 = severe pain limiting range of movement)
2. Force required to invoke tenderness (sum of measures at 9 sites)
3. Force required to invoke tenderness (highest value from 9 or 10 sites)
Notes
Data from the two stretch groups were pooled.
Stretched after exercise.
Soreness induced with 20 minutes of step-down exercise.
Risk of bias
Item
Authors’ judgement
Description
Allocation concealment?
Unclear
B - Unclear
Dawson 2005
Methods
Randomised (within subjects, cross-over)design
Participants
17 semi-professional Australian rules football players.
Interventions
Stretch condition: 15 minutes of supervised gentle stretching of the legs and back. This involved “2-3
repeats of 30 s held stretches across several muscle groups and joints”.
Control condition: no recovery procedures.
Outcomes
The following measures were taken at 15 and 48 hours:
1. Soreness (“how do you feel today?”) measured on a 1-7 scale anchored at “very, very good” and “very,
very sore”.
12
Dawson 2005
(Continued)
Notes
Part of a larger study in which each subject was exposed on 12 occasions, each separated by one week, to
one of four interventions (control, stretch, pool walking, hot-cold regimen).
Initial condition was allocated randomly and subsequent conditions by alternation. Stretch was before
exercise.
Soreness was induced by playing football.
All participants underwent a “pool recovery program” on the two days following exercise.
Risk of bias
Item
Description
Unclear
B - Unclear
Gulick 1996
Methods
Participants
~22 untrained participants randomly drawn from a larger sample of 35 men and 38 women aged 21 to
40 years.
Interventions
Stretch group (N ~ 11): static stretches of the wrist extensor muscles to “end range” for 10 minutes soon
after exercise.
Control group (N ~ 11): two placebo tablets after exercise and one further placebo tablet 24 hours later.
Outcomes
The following measures were taken 24, 48 and 72 hours after exercise:
1. On a 10 cm visual analogue scale anchored at “no soreness at all” and “soreness as bad as it could be”
2. Force required to invoke tenderness (sum of measures at 9 sites).
Notes
Part of a larger study in which subjects were randomised to seven interventions (control, stretch, antiinflammatory drug, intense exercise, ice massage, herbal ointment).
Stretch was after exercise.
Soreness was induced by 15 sets of 15 isokinetic eccentric wrist extensor contractions.
Risk of bias
Item
Description
Unclear
B - Unclear
High 1989
Methods
Participants
30 untrained participants (stretch group, 7 men and 7 women, mean age 19.4 years; control group, 8
men and 8 women, mean age 20.4 years)
13
High 1989
(Continued)
Interventions
Stretch group: stretched the quadriceps muscles (5 stretches, twice each on each leg for 50 seconds) of
both legs prior to exercise. Control group: did not stretch.
Outcomes
The following measures were taken at 24 hour intervals for 5 days:
1. Soreness on a 0-6 scale (0 = “absence of soreness”, 6 = “most severe pain possible from soreness”.
Notes
Part of a larger study in which subjects were randomised to four interventions (control, stretch, warm-up,
warm-up and stretch).
Stretch was before exercise.
Soreness was induced by 80 up to minutes of step-down exercise.
Risk of bias
Item
Description
Unclear
B - Unclear
Johansson 1999
Methods
Randomised (between legs) design
Participants
10 health female physiotherapy students (24 (SD 3) years, 62 (SD 3) kg, 169 (SD 5) cm), not taking antiinflammatory medication, with no lower limb musculoskeletal symptoms, had not participated in weight
training for more than 3 hours per week, and were not experiencing DOMS at the time of testing.
Interventions
Stretch leg: four 20-second “hamstring hurdle” stretches, with 20 second rests between stretches. Stretch
intensity was sufficient to induce a strong feeling of stretch but not pain.
Control leg: the hamstring muscles on the other side of the body were not stretched.
Outcomes
1. Hamstring muscle soreness on a 100 mm visual analogue scale (0 = “no soreness”, 100 = “worst possible
soreness”).
2. Tenderness measured with an algometer at 5 locations.
Notes
Stretch was before exercise.
Soreness was induced with 10 sets of 10 isokinetic knee flexor contractions.
Risk of bias
Item
Description
Unclear
B - Unclear
14
Maxwell 1988
Methods
Participants
~14 female first-year college students randomly drawn from a larger sample of 20 participants aged 1825 years (mean 19.4) and with varying histories of physical activity
Interventions
Stretch group (N ~ 6): participants lay in prone with the hips in a neutral position. A static stretch was
applied to the quadriceps to the point of discomfort and held for 15 seconds.
Control group (N ~ 6): participants did not stretch.
Outcomes
1. Pain experienced in the quadriceps muscle whilst descending a flight of stairs rated on a 10 cm visual
analogue scale anchored at 0 (“no pain”) and 10 (“severe pain”).
Notes
Stretch was after exercise.
Soreness was induced with 15 minutes of step-down exercise.
Risk of bias
Item
Description
Unclear
B - Unclear
McGlynn 1979
Methods
Participants
24 men aged 18-26 years who had not engaged in any systematic activity with their non-dominant arms
in the past 30 days
Interventions
Stretch group (N ~ 12): participants “hyperextended” the “forearm” for two minutes before releasing the
stretch for one minute. This sequence was repeated four times each at 6, 25, 30, 49 and 54 hours after
exercise.
Control group (N ~12): participants did not stretch.
Outcomes
1. Pain rated on a 30-point scale, where 0-10 was “mild” pain, 11-20 was “moderate” pain, and 21-30
was “severe” pain.
Notes
Part of a larger study in which subjects were randomised to three interventions (control, stretch, biofeedback).
Strech was after exercise.
Soreness was induced with eccentric elbow flexor exercises perfomed to failure.
Risk of bias
Item
Description
15
McGlynn 1979
(Continued)
Unclear
B - Unclear
Terry 1985
Methods
Participants
30 healthy participants, 15 men and 15 women, aged between 18 and 35 years. The modal level of exercise
was three times per week.
Interventions
Stretch (hold-relax)leg: participants lay prone with a wedge under the thigh and the knee was actively
flexed as far as possible. Then the participant performed an isometric contraction against manual resistance
for 4-6 seconds. This was followed by two seconds of relaxation and then an 8 second passive stretch. This
was repeated 5 times.
Control leg: the other leg was not stretched.
Outcomes
The following measures were taken on days 2 and 3:
1. “Pain” (or, variably, “pain, soreness or stiffness” or “soreness”)experienced that day during functional
movements rated on a scale of 0 (“no pain”) to 7 (“unbearably painful”).
Notes
Stretching was after exercise.
Soreness was induced with 20 minutes of step-up and step-down exercise.
Risk of bias
Item
Description
Unclear
B - Unclear
Wessel 1994a
Methods
Participants
10 sedentary adults (5 women and 5 men) with no history of back or lower limb injury or disease, mean
age 24.2 (SD 5.37), mean height 175.5 cm (SD 7.6) and mean mass 66.3 kg (8.9)
Interventions
Stretch leg: ten 60-second stretches of the hamstring muscles prior to exercise.
Control leg: not stretched.
Outcomes
The following measures were taken at 12 hour intervals to 72 hours after exercise:
1. Pain on a 10 cm visual analogue scale while walking on an “even” surface.
Notes
Randomisation was by toss of a coin.
Stretching was before exercise.
Soreness was induced by 3 sets of 20 isokinetic eccentric knee flexor contractions.
Risk of bias
16
Wessel 1994a
(Continued)
Item
Description
Unclear
B - Unclear
Wessel 1994b
Methods
Participants
10 sedentary adults (2 women and 8 men) with no history of back or lower limb injury or disease, mean
age 25.2 (SD 3.36), mean height 171.2 cm (SD 7.29) and mean mass 61.3 kg (7.12)
Interventions
Stretch leg: ten 60-second stretches of the hamstring muscles after exercise.
Control leg: not stretched.
Outcomes
The following measures were taken at 12 hour intervals to 72 hours after exercise:
1. Pain on a 10 cm visual analogue scale while walking on an “even” surface.
Notes
Randomisation was by toss of a coin.
Stretching was after exercise.
Soreness was induced by 3 sets of 20 isokinetic eccentric knee flexor contractions.
Risk of bias
Item
Description
Unclear
B - Unclear
DOMS: delayed onset muscle soreness
s: second
SD: standard deviation
~: approximately
The numbers of participants given in the Participants and Interventions columns are the number of participants who received the
stretch or control intervention. Thus, in studies with three or more arms the number of participants given is less than the total number
of participants in the study. In Gulick 1996, 73 participants were randomised to 7 groups but the size of each group is not given, so
sample sizes are given as “~11”. In Maxwell 1988, 20 participants were randomised to 3 groups but the size of each group is not given,
so sample sizes are given as “~6”. In McGlynn 1979, 36 participants were randomised to 3 groups but the size of each group is not
given, so sample sizes are given as “~12”.
17
Characteristics of excluded studies [ordered by study ID]
Abraham 1977
Stretching occurred 24 hours after exercise; not clear if conditions (stretch/flex) were randomised.
Bale 1991
No stretch condition.
de Vries 1966
Not randomised (compared before and after stretch).
Jayaraman 2004
Stretching did not begin until 36 hours after exercise.
Lightfoot 1997
Plantarflexors were sore, but quadriceps and hamstring muscles were stretched (fails criterion of “stretching
technique designed to prevent or treat delayed-onset muscle soreness”).
Lund 1998
Not randomised (used historical controls).
Prentice 1982
Stretch combined with heat or cold; interventions 24 hours after exercise; did not measure soreness.
Rahnama 2005
Stretching combined with jogging and walking and submaximal concentric contractions.
Reilly 2002
Not clearly randomised; stretch combined with jogging and “shake-down”.
Rodenburg 1994
Intervention was a combination of warm-up, stretch and massage.
Smith 1993
Compares effects of static and ballistic stretching.
18
DATA AND ANALYSES
Comparison 1. Pre-exercise stretching
Outcome or subgroup title
1 Soreness on day 0.5 (assessed 617 hours post-exercise)
2 Soreness on day 1 (assessed 1829 hours post-exercise)
No. of
studies
No. of
participants
1
3
70
1
2
40
1
2
40
Statistical method
Effect size
Mean Difference (IV, Fixed, 95% CI)
Totals not selected
-0.52 [-11.30,
10.26]
Totals not selected
0.72 [-11.20, 12.64]
Totals not selected
-2.50 [-15.82,
10.82]
Comparison 2. Post-exercise stretching
No. of
studies
No. of
participants
2
51
-0.47 [-6.05, 5.12]
4
127
-1.04 [-6.88, 4.79]
Totals not selected
1.12 [-4.63, 6.87]
Totals not selected
-0.03 [-7.49, 7.43]
1
5
158
1
3
67
Statistical method
Effect size
19
Comparison 3. Either pre- or post-exercise stretching
No. of
studies
No. of
participants
3
71
-0.97 [-6.24, 4.29]
7
197
-0.93 [-6.05, 4.20]
2
40
1.0 [-14.37, 16.37]
7
198
1.04 [-4.14, 6.22]
2
40
5
107
-0.75 [-13.25,
11.76]
-0.28 [-6.79, 6.22]
Statistical method
Effect size
Analysis 1.1. Comparison 1 Pre-exercise stretching, Outcome 1 Soreness on day 0.5 (assessed 6-17 hours
post-exercise).
Review:
Comparison: 1 Pre-exercise stretching
Outcome: 1 Soreness on day 0.5 (assessed 6-17 hours post-exercise)
Study or subgroup
Wessel 1994a
Stretch
Control
Mean Difference
N
Mean(SD)
N
Mean(SD)
10
4 (5)
10
9 (25)
IV,Fixed,95% CI
-5.00 [ -20.80, 10.80 ]
-100
-50
Favours stretch
Mean Difference
IV,Fixed,95% CI
0
50
100
Favours control
20
Analysis 1.2. Comparison 1 Pre-exercise stretching, Outcome 2 Soreness on day 1 (assessed 18-29 hours
post-exercise).
Review:
Outcome: 2 Soreness on day 1 (assessed 18-29 hours post-exercise)
Study or subgroup
Stretch
Control
Mean Difference
Weight
Mean Difference
N
Mean(SD)
N
Mean(SD)
IV,Fixed,95% CI
IV,Fixed,95% CI
High 1989
14
63.2 (37.5)
16
66.9 (37.5)
16.1 %
-3.70 [ -30.60, 23.20 ]
Johansson 1999
10
52.2 (15.5)
10
51 (15.8)
61.7 %
1.20 [ -12.52, 14.92 ]
Wessel 1994a
10
14 (24)
10
17 (28)
22.2 %
-3.00 [ -25.86, 19.86 ]
Total (95% CI)
34
100.0 %
-0.52 [ -11.30, 10.26 ]
36
Heterogeneity: Chi2 = 0.16, df = 2 (P = 0.92); I2 =0.0%
Test for overall effect: Z = 0.09 (P = 0.92)
-100
-50
0
Favours stretch
50
100
Favours control
post-exercise).
Review:
Study or subgroup
Wessel 1994a
Stretch
Control
Mean Difference
N
Mean(SD)
N
Mean(SD)
10
27 (30)
10
26 (28)
IV,Fixed,95% CI
1.00 [ -24.43, 26.43 ]
-100
-50
Favours stretch
Mean Difference
IV,Fixed,95% CI
0
50
100
Favours control
21
post-exercise).
Review:
Study or subgroup
Stretch
Control
Mean Difference
Weight
IV,Fixed,95% CI
Mean Difference
N
Mean(SD)
N
Mean(SD)
IV,Fixed,95% CI
Johansson 1999
10
70.5 (16.6)
10
69.4 (17.1)
65.1 %
1.10 [ -13.67, 15.87 ]
Wessel 1994a
10
32 (24)
10
32 (22)
34.9 %
0.0 [ -20.18, 20.18 ]
Total (95% CI)
20
100.0 %
0.72 [ -11.20, 12.64 ]
20
-100
-50
Favours stretch
0
50
100
Favours control
post-exercise).
Review:
Study or subgroup
Wessel 1994a
Stretch
Control
Mean Difference
N
Mean(SD)
N
Mean(SD)
10
27 (19)
10
29 (27)
IV,Fixed,95% CI
-2.00 [ -22.46, 18.46 ]
-100
-50
Favours stretch
Mean Difference
IV,Fixed,95% CI
0
50
100
Favours control
22
post-exercise).
Review:
Study or subgroup
Stretch
Control
Mean Difference
Weight
IV,Fixed,95% CI
Mean Difference
N
Mean(SD)
N
Mean(SD)
IV,Fixed,95% CI
Johansson 1999
10
39.4 (23.8)
10
38.2 (24.1)
40.3 %
1.20 [ -19.79, 22.19 ]
Wessel 1994a
10
13 (17)
10
18 (22)
59.7 %
-5.00 [ -22.23, 12.23 ]
Total (95% CI)
20
100.0 %
-2.50 [ -15.82, 10.82 ]
20
-100
-50
0
50
Favours stretch
100
Favours control
Analysis 2.1. Comparison 2 Post-exercise stretching, Outcome 1 Soreness on day 0.5 (assessed 6-17 hours
post-exercise).
Review:
Comparison: 2 Post-exercise stretching
Study or subgroup
Stretch
Control
Mean Difference
Weight
Mean Difference
57 (10)
30.6 %
3.00 [ -7.09, 13.09 ]
6 (9)
69.4 %
-2.00 [ -8.70, 4.70 ]
100.0 %
-0.47 [ -6.05, 5.12 ]
N
Mean(SD)
N
Mean(SD)
Dawson 2005
14
60 (17)
17
Wessel 1994b
10
4 (6)
10
Total (95% CI)
24
IV,Fixed,95% CI
IV,Fixed,95% CI
27
-100
-50
Favours stretch
0
50
100
Favours control
23
Analysis 2.2. Comparison 2 Post-exercise stretching, Outcome 2 Soreness on day 1 (assessed 18-29 hours
post-exercise).
Review:
Study or subgroup
Stretch
Control
Mean Difference
Weight
IV,Fixed,95% CI
Mean Difference
N
Mean(SD)
N
Mean(SD)
IV,Fixed,95% CI
Buroker 1989
15
61.7 (15.1)
8
64 (5.9)
45.3 %
-2.30 [ -10.97, 6.37 ]
McGlynn 1979
12
29.4 (20)
12
27.5 (19.5)
13.6 %
1.90 [ -13.90, 17.70 ]
Terry 1985
30
23.6 (23.6)
30
24.7 (23.6)
23.8 %
-1.10 [ -13.04, 10.84 ]
Wessel 1994b
10
13 (16)
10
13 (16)
17.3 %
0.0 [ -14.02, 14.02 ]
Total (95% CI)
67
100.0 %
-1.04 [ -6.88, 4.79 ]
60
-100
-50
0
Favours stretch
50
100
Favours control
post-exercise).
Review:
Study or subgroup
Wessel 1994b
Stretch
Control
Mean Difference
N
Mean(SD)
N
Mean(SD)
10
22 (22)
10
21 (22)
IV,Fixed,95% CI
1.00 [ -18.28, 20.28 ]
-100
-50
Favours stretch
Mean Difference
IV,Fixed,95% CI
0
50
100
Favours control
24
post-exercise).
Review:
Study or subgroup
Stretch
Control
Mean Difference
Weight
IV,Fixed,95% CI
Mean Difference
N
Mean(SD)
N
Mean(SD)
IV,Fixed,95% CI
Buroker 1989
15
65.2 (23.1)
8
68.5 (13.7)
14.6 %
-3.30 [ -18.36, 11.76 ]
Dawson 2005
14
52 (15)
17
48 (10)
39.2 %
4.00 [ -5.18, 13.18 ]
McGlynn 1979
12
29.4 (20)
12
27.2 (20.4)
12.7 %
2.20 [ -13.96, 18.36 ]
Terry 1985
30
23.3 (24)
30
24.4 (24)
22.4 %
-1.10 [ -13.25, 11.05 ]
Wessel 1994b
10
26 (22)
10
26 (17)
11.1 %
0.0 [ -17.23, 17.23 ]
Total (95% CI)
81
100.0 %
1.12 [ -4.63, 6.87 ]
77
-100
-50
0
Favours stretch
50
100
Favours control
post-exercise).
Review:
Study or subgroup
Wessel 1994b
Stretch
Control
Mean Difference
N
Mean(SD)
N
Mean(SD)
10
24 (19)
10
24 (17)
IV,Fixed,95% CI
0.0 [ -15.80, 15.80 ]
-100
-50
Favours stretch
Mean Difference
IV,Fixed,95% CI
0
50
100
Favours control
25
post-exercise).
Review:
Study or subgroup
Stretch
Control
Mean Difference
Weight
IV,Fixed,95% CI
Mean Difference
N
Mean(SD)
N
Mean(SD)
IV,Fixed,95% CI
Buroker 1989
15
51.8 (20.2)
8
50.3 (26.5)
12.6 %
1.50 [ -19.52, 22.52 ]
McGlynn 1979
12
11.4 (9.9)
12
11.4 (13)
65.1 %
0.0 [ -9.25, 9.25 ]
Wessel 1994b
10
17 (17)
10
18 (19)
22.3 %
-1.00 [ -16.80, 14.80 ]
Total (95% CI)
37
100.0 %
-0.03 [ -7.49, 7.43 ]
30
-100
-50
0
Favours stretch
50
100
Favours control
Analysis 3.1. Comparison 3 Either pre- or post-exercise stretching, Outcome 1 Soreness on day 0.5
(assessed 6-17 hours post-exercise).
Review:
Comparison: 3 Either pre- or post-exercise stretching
Study or subgroup
Stretch
Control
Mean Difference
Weight
Mean Difference
9 (25)
11.1 %
-5.00 [ -20.80, 10.80 ]
17
57 (10)
27.2 %
3.00 [ -7.09, 13.09 ]
10
6 (9)
61.7 %
-2.00 [ -8.70, 4.70 ]
100.0 %
-0.97 [ -6.24, 4.29 ]
N
Mean(SD)
N
Mean(SD)
Wessel 1994a
10
4 (5)
10
Dawson 2005
14
60 (17)
Wessel 1994b
10
4 (6)
Total (95% CI)
34
IV,Fixed,95% CI
IV,Fixed,95% CI
37
-100
-50
Favours stretch
0
50
100
Favours control
26
Analysis 3.2. Comparison 3 Either pre- or post-exercise stretching, Outcome 2 Soreness on day 1 (assessed
18-29 hours post-exercise).
Review:
Study or subgroup
Stretch
Control
Mean Difference
Weight
Mean Difference
64 (5.9)
35.0 %
-2.30 [ -10.97, 6.37 ]
16
66.9 (37.5)
3.6 %
-3.70 [ -30.60, 23.20 ]
52.2 (15.5)
10
51 (15.8)
14.0 %
1.20 [ -12.52, 14.92 ]
12
29.4 (20)
12
27.5 (19.5)
10.5 %
1.90 [ -13.90, 17.70 ]
Terry 1985
30
23.6 (23.6)
30
24.7 (23.6)
18.4 %
-1.10 [ -13.04, 10.84 ]
Wessel 1994a
10
14 (24)
10
17 (28)
5.0 %
-3.00 [ -25.86, 19.86 ]
Wessel 1994b
10
13 (16)
10
13 (16)
13.4 %
0.0 [ -14.02, 14.02 ]
100.0 %
-0.93 [ -6.05, 4.20 ]
N
Mean(SD)
N
Mean(SD)
Buroker 1989
15
61.7 (15.1)
8
High 1989
14
63.2 (37.5)
Johansson 1999
10
McGlynn 1979
Total (95% CI)
101
IV,Fixed,95% CI
IV,Fixed,95% CI
96
-100
-50
0
Favours stretch
50
100
Favours control
Review:
Study or subgroup
Stretch
Control
Mean Difference
Weight
Mean Difference
26 (28)
36.5 %
1.00 [ -24.43, 26.43 ]
21 (22)
63.5 %
1.00 [ -18.28, 20.28 ]
100.0 %
1.00 [ -14.37, 16.37 ]
N
Mean(SD)
N
Mean(SD)
Wessel 1994a
10
27 (30)
10
Wessel 1994b
10
22 (22)
10
Total (95% CI)
20
IV,Fixed,95% CI
IV,Fixed,95% CI
20
-100
-50
Favours stretch
0
50
100
Favours control
27
Review:
Study or subgroup
Stretch
Control
Mean Difference
Weight
Mean(SD)
N
Mean(SD)
Buroker 1989
15
65.2 (23.1)
8
68.5 (13.7)
11.8 %
-3.30 [ -18.36, 11.76 ]
Johansson 1999
10
70.5 (16.6)
10
69.4 (17.1)
12.3 %
1.10 [ -13.67, 15.87 ]
Dawson 2005
14
52 (15)
17
48 (10)
31.8 %
4.00 [ -5.18, 13.18 ]
McGlynn 1979
12
29.4 (20)
12
27.2 (20.4)
10.3 %
2.20 [ -13.96, 18.36 ]
Terry 1985
30
23.3 (24)
30
24.4 (24)
18.2 %
-1.10 [ -13.25, 11.05 ]
Wessel 1994a
10
32 (24)
10
32 (22)
6.6 %
0.0 [ -20.18, 20.18 ]
Wessel 1994b
10
26 (22)
10
26 (17)
9.0 %
0.0 [ -17.23, 17.23 ]
100.0 %
1.04 [ -4.14, 6.22 ]
Total (95% CI)
101
IV,Fixed,95% CI
Mean Difference
N
IV,Fixed,95% CI
97
-100
-50
Favours stretch
0
50
100
Favours control
28
Review:
Study or subgroup
Stretch
Control
Mean Difference
Weight
IV,Fixed,95% CI
Mean Difference
N
Mean(SD)
N
Mean(SD)
IV,Fixed,95% CI
Wessel 1994a
10
27 (19)
10
29 (27)
37.4 %
-2.00 [ -22.46, 18.46 ]
Wessel 1994b
10
24 (19)
10
24 (17)
62.6 %
0.0 [ -15.80, 15.80 ]
Total (95% CI)
20
100.0 %
-0.75 [ -13.25, 11.76 ]
20
-100
-50
0
50
Favours stretch
100
Favours control
Review:
Study or subgroup
Stretch
Control
Mean Difference
Weight
Mean Difference
50.3 (26.5)
9.6 %
1.50 [ -19.52, 22.52 ]
10
38.2 (24.1)
9.6 %
1.20 [ -19.79, 22.19 ]
11.4 (9.9)
12
11.4 (13)
49.6 %
0.0 [ -9.25, 9.25 ]
10
13 (17)
10
18 (22)
14.3 %
-5.00 [ -22.23, 12.23 ]
Wessel 1994b
10
18 (19)
10
17 (17)
17.0 %
1.00 [ -14.80, 16.80 ]
Total (95% CI)
57
100.0 %
-0.28 [ -6.79, 6.22 ]
N
Mean(SD)
N
Mean(SD)
Buroker 1989
15
51.8 (20.2)
8
Johansson 1999
10
39.4 (23.8)
McGlynn 1979
12
Wessel 1994a
IV,Fixed,95% CI
IV,Fixed,95% CI
50
-100
-50
Favours stretch
0
50
100
Favours control
29
APPENDICES
Appendix 1. Search strategy for The Cochrane Library
#1. stretch*
#2. flexib*
#3. PLIABILITY single term (MeSH)
#4. (range near motion)
#5. (((((warmup or warm-up) or (warm next down)) or cooldown) or cool-down) or (cool next down))
#6. ((((#1 or #2) or #3) or #4) or #5)
#7. (sore* near musc*)
#8. doms
#9. MUSCLE CONTRACTION explode all trees (MeSH)
#10. MUSCLE SKELETAL single term (MeSH)
#11. (#9 or #10)
#12. PAIN single term (MeSH)
#13. (#11 and #12)
#14. (#7 or #8 or #13)
#15. (#6 and #14)
Appendix 2. Search strategy for MEDLINE (OVID WEB)
1. stretch$.tw.
2. flexib$.tw.
3. Pliability/
4. (range adj3 motion).tw.
5. Range of Motion, Articular/
6. (warmup or warm-up or warm up or cooldown or cool-down or cool down).tw.
7. or/1-6
8. (sore$ adj3 musc$).tw.
9. DOMS.tw.
10. exp Muscle Contraction/
11. Muscle, Skeletal/
12. or/10-11
13. Pain/
14. and/12-13
15. or/8-9,14
16. and/7,15
Appendix 3. Search strategy for PEDro
The first search combined the following terms with “OR”:
[in the Title/abstract field] stretch*, sore*, DOMS
The second search combined the following terms with “AND”:
[in the Therapy field] stretching, mobilisation, manipulation, massage
[in the Subdiscipline field] sports
[in the Problem field] pain
30
WHAT’S NEW
Last assessed as up-to-date: 15 August 2006.
4 July 2008
Amended
Converted to new review format.
HISTORY
Protocol first published: Issue 1, 2004
Review first published: Issue 4, 2007
CONTRIBUTIONS OF AUTHORS
The protocol for the current review was written by Rob Herbert. Data collection, data extraction, quality assessment, data analysis
and interpretation were carried out by Marcos de Noronha and Rob Herbert. Rob Herbert wrote the review. Marcos de Noronha
commented on drafts and approved the final version. Rob Herbert is guarantor for the review.
DECLARATIONS OF INTEREST
None known.
SOURCES OF SUPPORT
Internal sources
• University of Sydney, Australia.
External sources
• No sources of support supplied
INDEX TERMS
31
Medical Subject Headings (MeSH)
∗ Exercise;
Muscle Stretching Exercises; Muscular Diseases [∗ prevention & control]; Pain [∗ prevention & control]; Randomized Controlled Trials as Topic
MeSH check words
Humans
32

Manuale corso base Evidence Based Practice

Transcript

Documenti analoghi

stampa slides

visualizza slides

Dott G Mascherini_130124093346

L`effetto dello stretching statico sulla capacità di

stretching - Roberto Sassi

FILATRICE CONTINUA mod. FTA 10 CONTINUOUS STRETCHER

STRETCHING : LA VERITA` Di Gretchen Reynolds Da: NY TIMES

Lesson 5th November

2-stretch-e-performance-sportiva

Spinal reflexes