----------------------- Need shift 5 command (have more than 9 arguments): ----------------------- ----------------------- Start of variable list: ----------------------- SCRIPT_DIR: /restricted/projectnb/waxmanlab/kkarri/G160_Samples/Scripts/09b_DiffExp_3_featureCounts Dataset_DIR: /restricted/projectnb/waxmanlab/kkarri/G160_Samples Dataset_Label: G160 GTF_Files_DIR: /unprotected/projects/waxmanlab/routines/GTF_Files ANNOTATION_FILE: RefSeq_GeneBody.gtf CONDITION_1_NAME: M_Nuclear CONDITION_2_NAME: M_Cytoplasmic Lengths_DIR: /unprotected/projects/waxmanlab/routines/GTF_Files/lengths GENE_LENGTHS_FILE: Intron_Only_Regions_Lengths.txt COUNT_DIR: RefSeq_Intron_GTF OUTPUT_PREFIX: DiffExp_v2_Intronic_Only DiffExp_Index: DiffExp_3c COL_SUFFIX: Intronic_Only COUNT_PROGRAM: featureCounts ----------------------- End of variable list ----------------------- ========================================================== Starting on : Thu Aug 24 05:10:13 EDT 2017 Running on node : scc-tk3 Current directory : /restricted/projectnb/waxmanlab/kkarri/G160_Samples/Scripts/09b_DiffExp_3_featureCounts Current job ID : 383411 Current job name : Step_09b_DiffExp_3c Task index number : undefined Parameter for multiple cores : 1 ========================================================== Change dir to scratch directory Print scratch directory location: /scratch/383411.1.linga Loading required modules... ------------------------------------------ Sample_DIR: G160_M5 Sample_ID: G160_M5 Description: Male M_Num: M5 Copy Condition_1 sample count files to Condition_1 folder calculate mapped reads: G160_M5 M_Num_Cond1_List: M5 ------------------------------------------ ------------------------------------------ Sample_DIR: G160_M6 Sample_ID: G160_M6 Description: Male M_Num: M6 Copy Condition_2 sample count files to Condition_2 folder calculate mapped reads: G160_M6 M_Num_Cond2_List: M6 ------------------------------------------ ========================================================== Number of replicates in each condition: NUM_REP_CONDITION1: 1 NUM_REP_CONDITION1: 1 WARNING: Either your Condition_1 or Condition_2 has only 1 sample (no replicates) The differential expression analysis is designed for cases where there are at least 2 replicates/biological condition. The resulting differential expression analysis will be converted to a stringent version due to absence of replicate samples. Samples from this comparison will be treated as "replicates" to estimate the gene level variance in edgeR and, separately in, DESeq2. Expect very few to zero DE genes from DESeq2, EdgeR should yield a DE gene count. Interpreting the results: use the edgeR fold change or DESeq ratio (not DESeq fold change) Note regarding the SEGEX upload file(s): Since DESeq2 fold changes are not the same as the ratio of the normalized counts between the conditions, so the output of the segex upload file will be different between "DESeq ratio" and "DESeq foldChange". The former is truly the ratio between the normalized counts between the two conditions (plus pseudocounts) whereas the latter is the fold change estimated by DESeq. ========================================================== Renaming input count files Counting program: Not HTSeq No lines removed. Counting program: Not HTSeq No lines removed. number of mapped reads 30291227 40878015 ========================================================== List files in Input -rw-r--r-- 1 kkarri pulmseq 300238 Aug 24 05:10 /scratch/383411.1.linga/Input/Intron_Only_Regions_Lengths.txt -rw-r--r-- 1 kkarri pulmseq 229285 Aug 24 05:10 /scratch/383411.1.linga/Input/M_Cytoplasmic0.out -rw-r--r-- 1 kkarri pulmseq 9 Aug 24 05:10 /scratch/383411.1.linga/Input/M_Cytoplasmic0_num_mapped_reads.txt -rw-r--r-- 1 kkarri pulmseq 246580 Aug 24 05:10 /scratch/383411.1.linga/Input/M_Nuclear0.out -rw-r--r-- 1 kkarri pulmseq 9 Aug 24 05:10 /scratch/383411.1.linga/Input/M_Nuclear0_num_mapped_reads.txt /scratch/383411.1.linga/Input/M_Cytoplasmic: total 8 drwxr-xr-x 2 kkarri pulmseq 4096 Aug 24 05:10 . drwxr-xr-x 4 kkarri pulmseq 4096 Aug 24 05:10 .. /scratch/383411.1.linga/Input/M_Nuclear: total 8 drwxr-xr-x 2 kkarri pulmseq 4096 Aug 24 05:10 . drwxr-xr-x 4 kkarri pulmseq 4096 Aug 24 05:10 .. ========================================================== Starting to run my commands Printing Rscript command: Rscript differentialAnalysis.R M_Nuclear M_Cytoplasmic 1 1 RefSeq_GeneBody.gtf /scratch/383411.1.linga/Input DiffExp_v2_Intronic_Only Intron_Only_Regions_Lengths.txt [1] "Arguments for differentialAnalysisDESeq.R:" [1] "M_Nuclear" [1] "M_Cytoplasmic" [1] 1 [1] 1 [1] "RefSeq_GeneBody.gtf" [1] "/scratch/383411.1.linga/Input" [1] "DiffExp_v2_Intronic_Only" [1] "Intron_Only_Regions_Lengths.txt" load GTF file ... parse attributes ... [1] "output file is in: /scratch/383411.1.linga/Input/DiffExp_v2_Intronic_Only_M_Nuclear_M_Cytoplasmic.txt" ========================================================== Create SEGEX formatted file: Printing Rscript command: Rscript formatForSegex_ver3.R DiffExp_v2_Intronic_Only_M_Nuclear_M_Cytoplasmic.txt M_Cytoplasmic_G160_M6_vs_M_Nuclear_G160_M5_DiffExp_v2_Intronic_Only_forSEGEXUpload 1 Intronic_Only ========================================================== Comparison_Info: M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5 #---------------------------------------------------------------------------------- Running Diff_Genes.R #---------------------------------------------------------------------------------- Printing Rscript command: Rscript Diff_Genes.R M_Cytoplasmic_G160_M6_vs_M_Nuclear_G160_M5_DiffExp_v2_Intronic_Only_forSEGEXUpload_DESeq.txt 2 0.05 DESeq Intronic_Only'_'M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5 [1] "Print arguments:" [1] "-----------------" [1] "Differential expression_Output:" [1] "M_Cytoplasmic_G160_M6_vs_M_Nuclear_G160_M5_DiffExp_v2_Intronic_Only_forSEGEXUpload_DESeq.txt" [1] "fold_change_cutoff:" [1] "2" [1] "padj_cutoff:" [1] "0.05" [1] "postfix:" [1] "DESeq" [1] "count_method:" [1] "Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5" [1] "-----------------" [1] "Differential Gene Counts" [1] "Number of significant differential genes with positive fold change (Up Genes):" [1] 0 7 [1] "Number of significant differential genes with negative fold change (Down Genes):" [1] 0 7 [1] "Check out Up_Genes_DESeq_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt!" [1] "Check out Down_Genes_DESeq_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt!" Rscript Diff_Genes.R M_Cytoplasmic_G160_M6_vs_M_Nuclear_G160_M5_DiffExp_v2_Intronic_Only_forSEGEXUpload_EdgeR.txt 2 0.05 EdgeR Intronic_Only'_'M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5 [1] "Print arguments:" [1] "-----------------" [1] "Differential expression_Output:" [1] "M_Cytoplasmic_G160_M6_vs_M_Nuclear_G160_M5_DiffExp_v2_Intronic_Only_forSEGEXUpload_EdgeR.txt" [1] "fold_change_cutoff:" [1] "2" [1] "padj_cutoff:" [1] "0.05" [1] "postfix:" [1] "EdgeR" [1] "count_method:" [1] "Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5" [1] "-----------------" [1] "Differential Gene Counts" [1] "Number of significant differential genes with positive fold change (Up Genes):" [1] 42 7 [1] "Number of significant differential genes with negative fold change (Down Genes):" [1] 1 7 [1] "Check out Up_Genes_EdgeR_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt!" [1] "Check out Down_Genes_EdgeR_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt!" #---------------------------------------------------------------------------------- Running Venn_Diff_Genes.R #---------------------------------------------------------------------------------- Printing Rscript command: #---------------------------------------------------------------------------------- Rscript Venn_Diff_Genes.R Down_Genes_DESeq_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt Down_Genes_EdgeR_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt Intronic_Only_Counting DiffExp_3c [1] "Print arguments:" [1] "-----------------" [1] "File1:" [1] "Down_Genes_DESeq_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt" [1] "File2:" [1] "Down_Genes_EdgeR_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt" [1] "Subtitle:" [1] "Intronic_Only_Counting" [1] "DiffExp_Index:" [1] "DiffExp_3c" [1] "-----------------" [1] "Down.DESeq.Intronic_Only.HTSeq" [1] "Down.EdgeR.Intronic_Only.HTSeq" null device 1 [1] "Removing VennDiagram*.log files" [1] "Check out Venn diagram and Count.Table!" #---------------------------------------------------------------------------------- Rscript Venn_Diff_Genes.R Up_Genes_DESeq_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt Up_Genes_EdgeR_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt Intronic_Only_Counting DiffExp_3c [1] "Print arguments:" [1] "-----------------" [1] "File1:" [1] "Up_Genes_DESeq_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt" [1] "File2:" [1] "Up_Genes_EdgeR_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt" [1] "Subtitle:" [1] "Intronic_Only_Counting" [1] "DiffExp_Index:" [1] "DiffExp_3c" [1] "-----------------" [1] "Up.DESeq.Intronic_Only.HTSeq" [1] "Up.EdgeR.Intronic_Only.HTSeq" null device 1 [1] "Removing VennDiagram*.log files" [1] "Check out Venn diagram and Count.Table!" #---------------------------------------------------------------------------------- #---------------------------------------------------------------------------------- Merging Count.Table(s) into one text file #---------------------------------------------------------------------------------- Comparison_Info: M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5 #---------------------------------------------------------------------------------- ========================================================== Renaming the Differential_Expression_File ========================================================== List files in Input -rw-r--r-- 1 kkarri pulmseq 316 Aug 24 05:11 /scratch/383411.1.linga/Input/DiffExp_3c_Venn_Tables_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt -rw-r--r-- 1 kkarri pulmseq 4831175 Aug 24 05:11 /scratch/383411.1.linga/Input/DiffExp_v2_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt -rwxr-xr-x 1 kkarri pulmseq 7394 Aug 24 05:10 /scratch/383411.1.linga/Input/Diff_Genes.R -rw-r--r-- 1 kkarri pulmseq 182 Aug 24 05:11 /scratch/383411.1.linga/Input/Down_Genes_DESeq_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt -rw-r--r-- 1 kkarri pulmseq 235 Aug 24 05:11 /scratch/383411.1.linga/Input/Down_Genes_EdgeR_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt -rw-r--r-- 1 kkarri pulmseq 300238 Aug 24 05:10 /scratch/383411.1.linga/Input/Intron_Only_Regions_Lengths.txt -rw-r--r-- 1 kkarri pulmseq 229285 Aug 24 05:10 /scratch/383411.1.linga/Input/M_Cytoplasmic0.out -rw-r--r-- 1 kkarri pulmseq 9 Aug 24 05:10 /scratch/383411.1.linga/Input/M_Cytoplasmic0_num_mapped_reads.txt -rw-r--r-- 1 kkarri pulmseq 2160625 Aug 24 05:11 /scratch/383411.1.linga/Input/M_Cytoplasmic_G160_M6_vs_M_Nuclear_G160_M5_DiffExp_v2_Intronic_Only_forSEGEXUpload_DESeq.txt -rw-r--r-- 1 kkarri pulmseq 3249490 Aug 24 05:11 /scratch/383411.1.linga/Input/M_Cytoplasmic_G160_M6_vs_M_Nuclear_G160_M5_DiffExp_v2_Intronic_Only_forSEGEXUpload_EdgeR.txt -rw-r--r-- 1 kkarri pulmseq 246580 Aug 24 05:10 /scratch/383411.1.linga/Input/M_Nuclear0.out -rw-r--r-- 1 kkarri pulmseq 9 Aug 24 05:10 /scratch/383411.1.linga/Input/M_Nuclear0_num_mapped_reads.txt -rw-r--r-- 1 kkarri pulmseq 182 Aug 24 05:11 /scratch/383411.1.linga/Input/Up_Genes_DESeq_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt -rw-r--r-- 1 kkarri pulmseq 4630 Aug 24 05:11 /scratch/383411.1.linga/Input/Up_Genes_EdgeR_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5.txt -rwxr-xr-x 1 kkarri pulmseq 9553 Aug 24 05:10 /scratch/383411.1.linga/Input/Venn_Diff_Genes.R -rw-r--r-- 1 kkarri pulmseq 30877 Aug 24 05:11 /scratch/383411.1.linga/Input/Venn_Down.DESeq.Intronic_Only.HTSeq.Down.EdgeR.Intronic_Only.HTSeq.png -rw-r--r-- 1 kkarri pulmseq 30320 Aug 24 05:11 /scratch/383411.1.linga/Input/Venn_Up.DESeq.Intronic_Only.HTSeq.Up.EdgeR.Intronic_Only.HTSeq.png -rwxr-xr-x 1 kkarri pulmseq 3925 Aug 24 05:10 /scratch/383411.1.linga/Input/formatForSegex_ver3.R /scratch/383411.1.linga/Input/M_Cytoplasmic: total 8 drwxr-xr-x 2 kkarri pulmseq 4096 Aug 24 05:10 . drwxr-xr-x 4 kkarri pulmseq 4096 Aug 24 05:11 .. /scratch/383411.1.linga/Input/M_Nuclear: total 8 drwxr-xr-x 2 kkarri pulmseq 4096 Aug 24 05:10 . drwxr-xr-x 4 kkarri pulmseq 4096 Aug 24 05:11 .. ========================================================== List files in scratch total 72M drwxr-xr-x 3 kkarri pulmseq 4.0K Aug 24 05:11 . drwxrwxrwt. 36 root root 20K Aug 24 05:10 .. -rw-r--r-- 1 kkarri pulmseq 54 Aug 24 05:10 Condition_1.txt -rw-r--r-- 1 kkarri pulmseq 54 Aug 24 05:10 Condition_2.txt drwxr-xr-x 4 kkarri pulmseq 4.0K Aug 24 05:11 Input -rw-r--r-- 1 kkarri pulmseq 72M Aug 24 05:10 RefSeq_GeneBody.gtf -rwxr-xr-x 1 kkarri pulmseq 16K Aug 24 05:10 differentialAnalysis.R ========================================================== Re-naming files in OUTPUT_DIR Need to append the COUNT_PROGRAM name to all output files List files in OUTPUT_DIR total 7.1M drwxr-sr-x 2 kkarri waxmanlab 32K Aug 24 05:11 . drwxr-sr-x 6 kkarri waxmanlab 32K Aug 24 05:10 .. -rw-r--r-- 1 kkarri waxmanlab 316 Aug 24 05:11 DiffExp_3c_Venn_Tables_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5_featureCounts.txt -rw-r--r-- 1 kkarri waxmanlab 4.7M Aug 24 05:11 DiffExp_v2_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5_featureCounts.txt -rw-r--r-- 1 kkarri waxmanlab 182 Aug 24 05:11 Down_Genes_DESeq_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5_featureCounts.txt -rw-r--r-- 1 kkarri waxmanlab 235 Aug 24 05:11 Down_Genes_EdgeR_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5_featureCounts.txt -rw-r--r-- 1 kkarri waxmanlab 2.1M Aug 24 05:11 M_Cytoplasmic_G160_M6_vs_M_Nuclear_G160_M5_DiffExp_v2_Intronic_Only_forSEGEXUpload_DESeq_featureCounts.txt -rw-r--r-- 1 kkarri waxmanlab 3.1M Aug 24 05:11 M_Cytoplasmic_G160_M6_vs_M_Nuclear_G160_M5_DiffExp_v2_Intronic_Only_forSEGEXUpload_EdgeR_featureCounts.txt -rw-r--r-- 1 kkarri waxmanlab 182 Aug 24 05:11 Up_Genes_DESeq_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5_featureCounts.txt -rw-r--r-- 1 kkarri waxmanlab 4.6K Aug 24 05:11 Up_Genes_EdgeR_Intronic_Only_M_Cytoplasmic_G160_M6_M_Nuclear_G160_M5_featureCounts.txt -rw-r--r-- 1 kkarri waxmanlab 31K Aug 24 05:11 Venn_Down.DESeq.Intronic_Only.HTSeq.Down.EdgeR.Intronic_Only.HTSeq_featureCounts.png -rw-r--r-- 1 kkarri waxmanlab 30K Aug 24 05:11 Venn_Up.DESeq.Intronic_Only.HTSeq.Up.EdgeR.Intronic_Only.HTSeq_featureCounts.png ========================================================== ========================================================== Finished on : Thu Aug 24 05:11:35 EDT 2017 1 minutes and 22 seconds elapsed. ==========================================================