G3Bio

    ANALYTICS

    DESeq2

    DESeq2

    Differential expression analysis for (bulk) RNA-Seq data

    Input data

    Files:

    • Salmon or Kallisto output files (with one sample associated with a subfolder)
    • A design.csv file describing each sample (i.e. sample metadata), for example,
    sampleid,group,subjectid
    SRR2123765,neg,1
    SRR2123766,pos,1
    SRR2123767,neg,2
    SRR2123768,pos,2
    SRR2123769,pos,3
    SRR2123770,neg,3
    SRR2123771,neg,4
    SRR2123772,pos,4
    ...
    

    , where the sample identifiers must be in the first column.

    Files are organized as follows:

    my_project
    |-- SRR2123771
    |   |-- aux_info
    |   |   |-- ambig_info.tsv
    |   |   |-- exp3_seq.gz
    |   |   |-- exp5_seq.gz
    |   |   |-- expected_bias.gz
    |   |   |-- exp_gc.gz
    |   |   |-- fld.gz
    |   |   |-- meta_info.json
    |   |   |-- obs3_seq.gz
    |   |   |-- obs5_seq.gz
    |   |   |-- observed_bias_3p.gz
    |   |   |-- observed_bias.gz
    |   |   `-- obs_gc.gz
    |   |-- libParams
    |   |   `-- flenDist.txt
    |   |-- logs
    |   |   `-- salmon_quant.log
    |   |-- cmd_info.json
    |   |-- lib_format_counts.json
    |   `-- quant.sf
    |-- SRR2123772
    |   |-- aux_info
    |   |   |-- ambig_info.tsv
    |   |   |-- exp3_seq.gz
    |   |   |-- exp5_seq.gz
    |   |   |-- expected_bias.gz
    |   |   |-- exp_gc.gz
    |   |   |-- fld.gz
    |   |   |-- meta_info.json
    |   |   |-- obs3_seq.gz
    |   |   |-- obs5_seq.gz
    |   |   |-- observed_bias_3p.gz
    |   |   |-- observed_bias.gz
    |   |   `-- obs_gc.gz
    |   |-- libParams
    |   |   `-- flenDist.txt
    |   |-- logs
    |   |   `-- salmon_quant.log
    |   |-- cmd_info.json
    |   |-- lib_format_counts.json
    |   `-- quant.sf
    |-- design.csv
    

    Output files

    • expression_matrix.tsv : Normalized gene expression matrix
    • *_comparion.tsv : comparison results (one for each contrast) from running LIMMA package

    For example,

    output/
    |-- expression_matrix.tsv
    |-- group-pos-vs-neg_comparison.tsv
    

    Parameter settings

    • factor_of_interest — Comparison column; optional
    • reference_level — Baseline level; must be one of the values in factor_of_interest column
    • grouping_factor — Subgroup analysis column; optional
    • blocking_factor — Blocking/Pairing column, used for controlling blocking/pairing effects; optional
    • minimum_total_counts — Threshold used for filtering out low-expression genes

    References

    Please cite:

    Bioconductor DESeq2 package

    • DataHub
    • Projects
    • analytics
      • Affymetrix
      • DESeq2
      • FGSEA
      • gscreend
      • Seurat
      • STACAS
    • collections-view-bulk-sample
      • CNV > CNV Distribution
      • CNV > Gene List
      • Expression > Expression Grouping
      • Expression > Expression Heatmap
      • Expression > Expression Volcano
      • Expression > Expression Waterfall
      • Methylation > Methylation Beta Values
      • Mutation > Alteration Distribution
      • Mutation > Mutated Genes
      • Mutation > Mutation Landscape
      • Sample > Data Availability
      • Sample > Sample Details
      • Sample > Sample Grouping
      • Special > Waterfall
      • Special > Correlation
      • Special > Grid Pattern
      • Special > Survival
    • collections-view-single-cell
      • Cell > Cell Details
      • Cell > Cell Grouping
      • Cell > Cell Grouping (Proportion)
      • Correlation
      • Dot Plot
      • Feature
      • Markers
      • Violin Plot
    • commands
      • g3tools
      • ConvertMetaData
      • ConvertData
      • LoadData
      • LoadSingleCell
      • RunAnalysis
    • getting-started
      • Quick Tour
    • installation
      • Server Installation (AMI-Based)
      • Server Installation (Docker-Based)
    G3Bio Logo
     

    Copyright © G3Bio 2025

      • DataHub
      • Projects
      • analytics
      • collections-view-bulk-sample
      • collections-view-single-cell
      • commands
      • getting-started
      • installation