Classifying Oryza sativa accessions into Indica and Japonica using logistic regression model with phenotypic data

In Oryza sativa, indica and japonica are pivotal subpopulations, and other subpopulations such as aus and aromatic are considered to be derived from indica or japonica. In this regard, Oryza sativa accessions are frequently viewed from the indica/japonica perspective. This study introduces a computa...

Full description

Bibliographic Details
Main Author: Bongsong Kim
Format: Article
Language:English
Published: PeerJ Inc. 2019-11-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/7259.pdf
id doaj-ec74a377a6b4470aa044a092c6f97863
record_format Article
spelling doaj-ec74a377a6b4470aa044a092c6f978632020-11-25T02:19:49ZengPeerJ Inc.PeerJ2167-83592019-11-017e725910.7717/peerj.7259Classifying Oryza sativa accessions into Indica and Japonica using logistic regression model with phenotypic dataBongsong Kim0Noble Research Institute LLC, Ardmore, OK, Carter, United States of AmericaIn Oryza sativa, indica and japonica are pivotal subpopulations, and other subpopulations such as aus and aromatic are considered to be derived from indica or japonica. In this regard, Oryza sativa accessions are frequently viewed from the indica/japonica perspective. This study introduces a computational method for indica/japonica classification by applying phenotypic variables to the logistic regression model (LRM). The population used in this study included 413 Oryza sativa accessions, of which 280 accessions were indica or japonica. Out of 24 phenotypic variables, a set of seven phenotypic variables was identified to collectively generate the fully accurate indica/japonica separation power of the LRM. The resulting parameters were used to define the customized LRM. Given the 280 indica/japonica accessions, the classification accuracy of the customized LRM along with the set of seven phenotypic variables was estimated by 100 iterations of ten-fold cross-validations. As a result, the classification accuracy of 100% was achieved. This suggests that the LRM can be an effective tool to analyze the indica/japonica classification with phenotypic variables in Oryza sativa.https://peerj.com/articles/7259.pdfLogistic regressionOryza sativaIndicaJaponicaAsian cultivated riceGenetic diversity in rice
collection DOAJ
language English
format Article
sources DOAJ
author Bongsong Kim
spellingShingle Bongsong Kim
Classifying Oryza sativa accessions into Indica and Japonica using logistic regression model with phenotypic data
PeerJ
Logistic regression
Oryza sativa
Indica
Japonica
Asian cultivated rice
Genetic diversity in rice
author_facet Bongsong Kim
author_sort Bongsong Kim
title Classifying Oryza sativa accessions into Indica and Japonica using logistic regression model with phenotypic data
title_short Classifying Oryza sativa accessions into Indica and Japonica using logistic regression model with phenotypic data
title_full Classifying Oryza sativa accessions into Indica and Japonica using logistic regression model with phenotypic data
title_fullStr Classifying Oryza sativa accessions into Indica and Japonica using logistic regression model with phenotypic data
title_full_unstemmed Classifying Oryza sativa accessions into Indica and Japonica using logistic regression model with phenotypic data
title_sort classifying oryza sativa accessions into indica and japonica using logistic regression model with phenotypic data
publisher PeerJ Inc.
series PeerJ
issn 2167-8359
publishDate 2019-11-01
description In Oryza sativa, indica and japonica are pivotal subpopulations, and other subpopulations such as aus and aromatic are considered to be derived from indica or japonica. In this regard, Oryza sativa accessions are frequently viewed from the indica/japonica perspective. This study introduces a computational method for indica/japonica classification by applying phenotypic variables to the logistic regression model (LRM). The population used in this study included 413 Oryza sativa accessions, of which 280 accessions were indica or japonica. Out of 24 phenotypic variables, a set of seven phenotypic variables was identified to collectively generate the fully accurate indica/japonica separation power of the LRM. The resulting parameters were used to define the customized LRM. Given the 280 indica/japonica accessions, the classification accuracy of the customized LRM along with the set of seven phenotypic variables was estimated by 100 iterations of ten-fold cross-validations. As a result, the classification accuracy of 100% was achieved. This suggests that the LRM can be an effective tool to analyze the indica/japonica classification with phenotypic variables in Oryza sativa.
topic Logistic regression
Oryza sativa
Indica
Japonica
Asian cultivated rice
Genetic diversity in rice
url https://peerj.com/articles/7259.pdf
work_keys_str_mv AT bongsongkim classifyingoryzasativaaccessionsintoindicaandjaponicausinglogisticregressionmodelwithphenotypicdata
_version_ 1724874161268981760