Loading…

FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes

High-impact journals are promoting transparency of data. Modern scientific methods can be automated and produce disparate samples sizes. In many cases, it is desirable to retain identical or pre-defined sample sizes between replicates or groups. However, choosing which subset of originally acquired...

Full description

Saved in:

Bibliographic Details
Published in:	Journal of biological methods 2019-09, Vol.6 (3), p.1-e118
Main Authors:	K Ortell, Katherine, M Switonski, Pawel, Delaney, Joe Ryan
Format:	Article
Language:	English
Subjects:	Resource
Citations:	Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c2099-ab292bf6e1b44a3b1fda5df23e51ceb73c5b0e393f21cf241cc467d1ad7ba9153
cites
container_end_page	e118
container_issue	3
container_start_page	1
container_title	Journal of biological methods
container_volume	6
creator	K Ortell, Katherine M Switonski, Pawel Delaney, Joe Ryan
description	High-impact journals are promoting transparency of data. Modern scientific methods can be automated and produce disparate samples sizes. In many cases, it is desirable to retain identical or pre-defined sample sizes between replicates or groups. However, choosing which subset of originally acquired data that best matches the entirety of the data set without introducing bias is not trivial. Here, we released a free online tool, FairSubset, and its constituent Shiny App R code to subset data in an unbiased fashion. Subsets were set at the same N across samples and retained representative average and standard deviation information. The method can be used for quantitation of entire fields of view or other replicates without biasing the data pool toward large N samples. We showed examples of the tool's use with fluorescence data and DNA-damage related Comet tail quantitation. This FairSubset tool and the method to retain distribution information at the single-datum level may be considered for standardized use in fair publishing practices.
doi_str_mv	10.14440/jbm.2019.299
format	article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6761370</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2301445839</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2099-ab292bf6e1b44a3b1fda5df23e51ceb73c5b0e393f21cf241cc467d1ad7ba9153</originalsourceid><addsrcrecordid>eNpVkUtLAzEURoMotqhL91m6mZrHPIwLQcSqILhQ1yHJ3NiUmWZMMhb99aatiG5yA9_huyEHoVNKZrQsS3K-1P2MESpmTIg9NGWc1YUQhO7_uU_QSYxLQgitRE0afogmnFYXOeZTtJ4rF55HHSFd4mucvO_ygc3C-wg4wBAgwiqp5D4Axy0Xsbe4VUlh6wMeM7Z2abFhO2dUgpwH_Bb8OOxIZy2E3IGj6ocut7gviMfowKouwsnPPEKv89uXm_vi8enu4eb6sTCMCFEozQTTtgaqy1JxTW2rqtYyDhU1oBtuKk2AC24ZNZaV1Jiyblqq2kYrQSt-hK52vcOoe2hNfkdQnRyC61X4lF45-T9ZuYV88x-ybmrKG5ILzn4Kgn8fISbZu2ig69QK_Bgl4ySryN8pMlrsUBN8jAHs7xpK5NaXzL7kxpfMvvg3DBSKug</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2301445839</pqid></control><display><type>article</type><title>FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes</title><source>PubMed Central</source><creator>K Ortell, Katherine ; M Switonski, Pawel ; Delaney, Joe Ryan</creator><creatorcontrib>K Ortell, Katherine ; M Switonski, Pawel ; Delaney, Joe Ryan</creatorcontrib><description>High-impact journals are promoting transparency of data. Modern scientific methods can be automated and produce disparate samples sizes. In many cases, it is desirable to retain identical or pre-defined sample sizes between replicates or groups. However, choosing which subset of originally acquired data that best matches the entirety of the data set without introducing bias is not trivial. Here, we released a free online tool, FairSubset, and its constituent Shiny App R code to subset data in an unbiased fashion. Subsets were set at the same N across samples and retained representative average and standard deviation information. The method can be used for quantitation of entire fields of view or other replicates without biasing the data pool toward large N samples. We showed examples of the tool's use with fluorescence data and DNA-damage related Comet tail quantitation. This FairSubset tool and the method to retain distribution information at the single-datum level may be considered for standardized use in fair publishing practices.</description><identifier>ISSN: 2326-9901</identifier><identifier>EISSN: 2326-9901</identifier><identifier>DOI: 10.14440/jbm.2019.299</identifier><identifier>PMID: 31583263</identifier><language>eng</language><publisher>Journal of Biological Methods</publisher><subject>Resource</subject><ispartof>Journal of biological methods, 2019-09, Vol.6 (3), p.1-e118</ispartof><rights>2013-2019 The Journal of Biological Methods, All rights reserved. 2019</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c2099-ab292bf6e1b44a3b1fda5df23e51ceb73c5b0e393f21cf241cc467d1ad7ba9153</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6761370/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6761370/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,315,733,786,790,891,27957,27958,53827,53829</link.rule.ids></links><search><creatorcontrib>K Ortell, Katherine</creatorcontrib><creatorcontrib>M Switonski, Pawel</creatorcontrib><creatorcontrib>Delaney, Joe Ryan</creatorcontrib><title>FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes</title><title>Journal of biological methods</title><description>High-impact journals are promoting transparency of data. Modern scientific methods can be automated and produce disparate samples sizes. In many cases, it is desirable to retain identical or pre-defined sample sizes between replicates or groups. However, choosing which subset of originally acquired data that best matches the entirety of the data set without introducing bias is not trivial. Here, we released a free online tool, FairSubset, and its constituent Shiny App R code to subset data in an unbiased fashion. Subsets were set at the same N across samples and retained representative average and standard deviation information. The method can be used for quantitation of entire fields of view or other replicates without biasing the data pool toward large N samples. We showed examples of the tool's use with fluorescence data and DNA-damage related Comet tail quantitation. This FairSubset tool and the method to retain distribution information at the single-datum level may be considered for standardized use in fair publishing practices.</description><subject>Resource</subject><issn>2326-9901</issn><issn>2326-9901</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNpVkUtLAzEURoMotqhL91m6mZrHPIwLQcSqILhQ1yHJ3NiUmWZMMhb99aatiG5yA9_huyEHoVNKZrQsS3K-1P2MESpmTIg9NGWc1YUQhO7_uU_QSYxLQgitRE0afogmnFYXOeZTtJ4rF55HHSFd4mucvO_ygc3C-wg4wBAgwiqp5D4Axy0Xsbe4VUlh6wMeM7Z2abFhO2dUgpwH_Bb8OOxIZy2E3IGj6ocut7gviMfowKouwsnPPEKv89uXm_vi8enu4eb6sTCMCFEozQTTtgaqy1JxTW2rqtYyDhU1oBtuKk2AC24ZNZaV1Jiyblqq2kYrQSt-hK52vcOoe2hNfkdQnRyC61X4lF45-T9ZuYV88x-ybmrKG5ILzn4Kgn8fISbZu2ig69QK_Bgl4ySryN8pMlrsUBN8jAHs7xpK5NaXzL7kxpfMvvg3DBSKug</recordid><startdate>20190903</startdate><enddate>20190903</enddate><creator>K Ortell, Katherine</creator><creator>M Switonski, Pawel</creator><creator>Delaney, Joe Ryan</creator><general>Journal of Biological Methods</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20190903</creationdate><title>FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes</title><author>K Ortell, Katherine ; M Switonski, Pawel ; Delaney, Joe Ryan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2099-ab292bf6e1b44a3b1fda5df23e51ceb73c5b0e393f21cf241cc467d1ad7ba9153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Resource</topic><toplevel>online_resources</toplevel><creatorcontrib>K Ortell, Katherine</creatorcontrib><creatorcontrib>M Switonski, Pawel</creatorcontrib><creatorcontrib>Delaney, Joe Ryan</creatorcontrib><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of biological methods</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>K Ortell, Katherine</au><au>M Switonski, Pawel</au><au>Delaney, Joe Ryan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes</atitle><jtitle>Journal of biological methods</jtitle><date>2019-09-03</date><risdate>2019</risdate><volume>6</volume><issue>3</issue><spage>1</spage><epage>e118</epage><pages>1-e118</pages><issn>2326-9901</issn><eissn>2326-9901</eissn><notes>ObjectType-Article-1</notes><notes>SourceType-Scholarly Journals-1</notes><notes>ObjectType-Feature-2</notes><notes>content type line 23</notes><notes>These authors contributed equally to the work</notes><notes>Competing interests: The authors have declared that no competing interests exist.</notes><abstract>High-impact journals are promoting transparency of data. Modern scientific methods can be automated and produce disparate samples sizes. In many cases, it is desirable to retain identical or pre-defined sample sizes between replicates or groups. However, choosing which subset of originally acquired data that best matches the entirety of the data set without introducing bias is not trivial. Here, we released a free online tool, FairSubset, and its constituent Shiny App R code to subset data in an unbiased fashion. Subsets were set at the same N across samples and retained representative average and standard deviation information. The method can be used for quantitation of entire fields of view or other replicates without biasing the data pool toward large N samples. We showed examples of the tool's use with fluorescence data and DNA-damage related Comet tail quantitation. This FairSubset tool and the method to retain distribution information at the single-datum level may be considered for standardized use in fair publishing practices.</abstract><pub>Journal of Biological Methods</pub><pmid>31583263</pmid><doi>10.14440/jbm.2019.299</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2326-9901
ispartof	Journal of biological methods, 2019-09, Vol.6 (3), p.1-e118
issn	2326-9901 2326-9901
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6761370
source	PubMed Central
subjects	Resource
title	FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-09-22T16%3A29%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=FairSubset:%20A%20tool%20to%20choose%20representative%20subsets%20of%20data%20for%20use%20with%20replicates%20or%20groups%20of%20different%20sample%20sizes&rft.jtitle=Journal%20of%20biological%20methods&rft.au=K%20Ortell,%20Katherine&rft.date=2019-09-03&rft.volume=6&rft.issue=3&rft.spage=1&rft.epage=e118&rft.pages=1-e118&rft.issn=2326-9901&rft.eissn=2326-9901&rft_id=info:doi/10.14440/jbm.2019.299&rft_dat=%3Cproquest_pubme%3E2301445839%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c2099-ab292bf6e1b44a3b1fda5df23e51ceb73c5b0e393f21cf241cc467d1ad7ba9153%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2301445839&rft_id=info:pmid/31583263&rfr_iscdi=true