Double sampling with multiple imputation to answer large sample meta-research questions: Introduction and illustration by evaluating adherence to two simple CONSORT guidelines

BACKGROUND: Meta-research can involve manual retrieval and evaluation of research, which is resource intensive. Creation of high throughput methods (e.g., search heuristics, crowdsourcing) has improved feasibility of large meta-research questions, but possibly at the cost of accuracy. OBJECTIVE: To...

Full description

Bibliographic Details
Main Authors:	Patrice L. Capers, Andrew W Brown, John eDawson, David B Allison
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2015-03-01
Series:	Frontiers in Nutrition
Subjects:	modeling adherence CONSORT Double sampling Multiple imputation Meta-research
Online Access:	http://journal.frontiersin.org/Journal/10.3389/fnut.2015.00006/full

id	doaj-8d4802761f1743d1955d643053cf8ebb
record_format	Article
spelling	doaj-8d4802761f1743d1955d643053cf8ebb2020-11-25T01:06:27ZengFrontiers Media S.A.Frontiers in Nutrition2296-861X2015-03-01210.3389/fnut.2015.00006123291Double sampling with multiple imputation to answer large sample meta-research questions: Introduction and illustration by evaluating adherence to two simple CONSORT guidelinesPatrice L. Capers0Andrew W Brown1John eDawson2John eDawson3David B Allison4David B Allison5David B Allison6David B Allison7University of Alabama at BirminghamUniversity of Alabama at BirminghamUniversity of Alabama at BirminghamUniversity of Alabama at BirminghamUniversity of Alabama at BirminghamUniversity of Alabama at BirminghamUniversity of Alabama at BirminghamUniversity of Alabama at BirminghamBACKGROUND: Meta-research can involve manual retrieval and evaluation of research, which is resource intensive. Creation of high throughput methods (e.g., search heuristics, crowdsourcing) has improved feasibility of large meta-research questions, but possibly at the cost of accuracy. OBJECTIVE: To evaluate the use of double sampling combined with multiple imputation (DS+MI) to address meta-research questions, using as an example adherence of PubMed entries to two simple Consolidated Standards of Reporting Trials (CONSORT) guidelines for titles and abstracts. METHODS: For the DS large sample, we retrieved all PubMed entries satisfying the filters: RCT; human; abstract available; and English language (n=322,107). For the DS subsample, we randomly sampled 500 entries from the large sample. The large sample was evaluated with a lower rigor, higher throughput (RLOTHI) method using search heuristics, while the subsample was evaluated using a higher rigor, lower throughput (RHITLO) human rating method. Multiple imputation of the missing-completely-at-random RHITLO data for the large sample was informed by: RHITLO data from the subsample; RLOTHI data from the large sample; whether a study was an RCT; and country and year of publication. RESULTS: The RHITLO and RLOTHI methods in the subsample largely agreed (phi coefficients: title=1.00, abstract=0.92). Compliance with abstract and title criteria has increased over time, with non-US countries improving more rapidly. DS+MI logistic regression estimates were more precise than subsample estimates (e.g., 95% CI for change in title and abstract compliance by Year: subsample RHITLO 1.050-1.174 vs. DS+MI 1.082-1.151). As evidence of improved accuracy, DS+MI coefficient estimates were closer to RHITLO than the large sample RLOTHI. CONCLUSIONS: Our results support our hypothesis that DS+MI would result in improved precision and accuracy. This method is flexible and may provide a practical way to examine large corpora of literature.http://journal.frontiersin.org/Journal/10.3389/fnut.2015.00006/fullmodelingadherenceCONSORTDouble samplingMultiple imputationMeta-research
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Patrice L. Capers Andrew W Brown John eDawson John eDawson David B Allison David B Allison David B Allison David B Allison
spellingShingle	Patrice L. Capers Andrew W Brown John eDawson John eDawson David B Allison David B Allison David B Allison David B Allison Double sampling with multiple imputation to answer large sample meta-research questions: Introduction and illustration by evaluating adherence to two simple CONSORT guidelines Frontiers in Nutrition modeling adherence CONSORT Double sampling Multiple imputation Meta-research
author_facet	Patrice L. Capers Andrew W Brown John eDawson John eDawson David B Allison David B Allison David B Allison David B Allison
author_sort	Patrice L. Capers
title	Double sampling with multiple imputation to answer large sample meta-research questions: Introduction and illustration by evaluating adherence to two simple CONSORT guidelines
title_short	Double sampling with multiple imputation to answer large sample meta-research questions: Introduction and illustration by evaluating adherence to two simple CONSORT guidelines
title_full	Double sampling with multiple imputation to answer large sample meta-research questions: Introduction and illustration by evaluating adherence to two simple CONSORT guidelines
title_fullStr	Double sampling with multiple imputation to answer large sample meta-research questions: Introduction and illustration by evaluating adherence to two simple CONSORT guidelines
title_full_unstemmed	Double sampling with multiple imputation to answer large sample meta-research questions: Introduction and illustration by evaluating adherence to two simple CONSORT guidelines
title_sort	double sampling with multiple imputation to answer large sample meta-research questions: introduction and illustration by evaluating adherence to two simple consort guidelines
publisher	Frontiers Media S.A.
series	Frontiers in Nutrition
issn	2296-861X
publishDate	2015-03-01
description	BACKGROUND: Meta-research can involve manual retrieval and evaluation of research, which is resource intensive. Creation of high throughput methods (e.g., search heuristics, crowdsourcing) has improved feasibility of large meta-research questions, but possibly at the cost of accuracy. OBJECTIVE: To evaluate the use of double sampling combined with multiple imputation (DS+MI) to address meta-research questions, using as an example adherence of PubMed entries to two simple Consolidated Standards of Reporting Trials (CONSORT) guidelines for titles and abstracts. METHODS: For the DS large sample, we retrieved all PubMed entries satisfying the filters: RCT; human; abstract available; and English language (n=322,107). For the DS subsample, we randomly sampled 500 entries from the large sample. The large sample was evaluated with a lower rigor, higher throughput (RLOTHI) method using search heuristics, while the subsample was evaluated using a higher rigor, lower throughput (RHITLO) human rating method. Multiple imputation of the missing-completely-at-random RHITLO data for the large sample was informed by: RHITLO data from the subsample; RLOTHI data from the large sample; whether a study was an RCT; and country and year of publication. RESULTS: The RHITLO and RLOTHI methods in the subsample largely agreed (phi coefficients: title=1.00, abstract=0.92). Compliance with abstract and title criteria has increased over time, with non-US countries improving more rapidly. DS+MI logistic regression estimates were more precise than subsample estimates (e.g., 95% CI for change in title and abstract compliance by Year: subsample RHITLO 1.050-1.174 vs. DS+MI 1.082-1.151). As evidence of improved accuracy, DS+MI coefficient estimates were closer to RHITLO than the large sample RLOTHI. CONCLUSIONS: Our results support our hypothesis that DS+MI would result in improved precision and accuracy. This method is flexible and may provide a practical way to examine large corpora of literature.
topic	modeling adherence CONSORT Double sampling Multiple imputation Meta-research
url	http://journal.frontiersin.org/Journal/10.3389/fnut.2015.00006/full
work_keys_str_mv	AT patricelcapers doublesamplingwithmultipleimputationtoanswerlargesamplemetaresearchquestionsintroductionandillustrationbyevaluatingadherencetotwosimpleconsortguidelines AT andrewwbrown doublesamplingwithmultipleimputationtoanswerlargesamplemetaresearchquestionsintroductionandillustrationbyevaluatingadherencetotwosimpleconsortguidelines AT johnedawson doublesamplingwithmultipleimputationtoanswerlargesamplemetaresearchquestionsintroductionandillustrationbyevaluatingadherencetotwosimpleconsortguidelines AT johnedawson doublesamplingwithmultipleimputationtoanswerlargesamplemetaresearchquestionsintroductionandillustrationbyevaluatingadherencetotwosimpleconsortguidelines AT davidballison doublesamplingwithmultipleimputationtoanswerlargesamplemetaresearchquestionsintroductionandillustrationbyevaluatingadherencetotwosimpleconsortguidelines AT davidballison doublesamplingwithmultipleimputationtoanswerlargesamplemetaresearchquestionsintroductionandillustrationbyevaluatingadherencetotwosimpleconsortguidelines AT davidballison doublesamplingwithmultipleimputationtoanswerlargesamplemetaresearchquestionsintroductionandillustrationbyevaluatingadherencetotwosimpleconsortguidelines AT davidballison doublesamplingwithmultipleimputationtoanswerlargesamplemetaresearchquestionsintroductionandillustrationbyevaluatingadherencetotwosimpleconsortguidelines
_version_	1725190112974733312

Double sampling with multiple imputation to answer large sample meta-research questions: Introduction and illustration by evaluating adherence to two simple CONSORT guidelines

Similar Items