Subset data frame for miRNA names whose frequency exceeds a threshold.

subset_mir_threshold(df, threshold = 1, col.mir = miRNA, col.pmid = PMID)

Arguments

df

Data frame containing miRNA names and a PubMed-IDs.

threshold

Integer or float. If threshold >= 1, retains miRNA names in at least threshold abstracts. If threshold is between 0 and 1, retains miRNA names mentioned in at least threshold abstracts of all abstracts in df.

col.mir

Symbol. Column containing miRNA names.

col.pmid

Symbol. Column containing PubMed-IDs.

Value

Data frame, subset for miRNA names whose frequency exceeds a threshold.

Details

Subset data frame for miRNA names whose frequency exceeds a threshold. This threshold can either be an absolute value, e.g. 3, or a float between 0 and 1, e.g. 0.2. If threshold is an absolute value, subset_mir_threshold() retains miRNA names mentioned in at least threshold abstracts. If threshold is a float between 0 and 1, subset_mir_threshold() retains miRNA names mentioned in at least threshold abstracts of all abstracts in df.

See also