----------------------- Need shift 5 command (have more than 9 arguments): ----------------------- ----------------------- Start of variable list: ----------------------- SCRIPT_DIR: /restricted/projectnb/waxmanlab/kkarri/G160_Samples/Scripts/09a_DiffExp_5_HTSeq Dataset_DIR: /restricted/projectnb/waxmanlab/kkarri/G160_Samples Dataset_Label: G160 GTF_Files_DIR: /unprotected/projects/waxmanlab/routines/GTF_Files ANNOTATION_FILE: RefSeq_GeneBody.gtf CONDITION_1_NAME: F_Cytoplasmic CONDITION_2_NAME: M_Cytoplasmic Lengths_DIR: /unprotected/projects/waxmanlab/routines/GTF_Files/lengths GENE_LENGTHS_FILE: Exon_Regions_Lengths.txt COUNT_DIR: RefSeq_Exon_GTF OUTPUT_PREFIX: DiffExp_v2_GeneBody DiffExp_Index: DiffExp_5a COL_SUFFIX: GeneBody COUNT_PROGRAM: HTSeq ----------------------- End of variable list ----------------------- ========================================================== Starting on : Thu Aug 24 04:42:45 EDT 2017 Running on node : scc-tm1 Current directory : /restricted/projectnb/waxmanlab/kkarri/G160_Samples/Scripts/09a_DiffExp_5_HTSeq Current job ID : 383345 Current job name : Step_09a_DiffExp_5a Task index number : undefined Parameter for multiple cores : 1 ========================================================== Change dir to scratch directory Print scratch directory location: /scratch/383345.1.linga Loading required modules... ------------------------------------------ Sample_DIR: G160_M4 Sample_ID: G160_M4 Description: Female M_Num: M4 Copy Condition_1 sample count files to Condition_1 folder calculate mapped reads: G160_M4 M_Num_Cond1_List: M4 ------------------------------------------ ------------------------------------------ Sample_DIR: G160_M6 Sample_ID: G160_M6 Description: Male M_Num: M6 Copy Condition_2 sample count files to Condition_2 folder calculate mapped reads: G160_M6 M_Num_Cond2_List: M6 ------------------------------------------ ========================================================== Number of replicates in each condition: NUM_REP_CONDITION1: 1 NUM_REP_CONDITION1: 1 WARNING: Either your Condition_1 or Condition_2 has only 1 sample (no replicates) The differential expression analysis is designed for cases where there are at least 2 replicates/biological condition. The resulting differential expression analysis will be converted to a stringent version due to absence of replicate samples. Samples from this comparison will be treated as "replicates" to estimate the gene level variance in edgeR and, separately in, DESeq2. Expect very few to zero DE genes from DESeq2, EdgeR should yield a DE gene count. Interpreting the results: use the edgeR fold change or DESeq ratio (not DESeq fold change) Note regarding the SEGEX upload file(s): Since DESeq2 fold changes are not the same as the ratio of the normalized counts between the conditions, so the output of the segex upload file will be different between "DESeq ratio" and "DESeq foldChange". The former is truly the ratio between the normalized counts between the two conditions (plus pseudocounts) whereas the latter is the fold change estimated by DESeq. ========================================================== Renaming input count files Counting program: HTSeq Removing last 5 lines (remove special counters) Counting program: HTSeq Removing last 5 lines (remove special counters) number of mapped reads 24143242 30291227 ========================================================== List files in Input -rw-r--r-- 1 kkarri pulmseq 291522 Aug 24 04:42 /scratch/383345.1.linga/Input/Exon_Regions_Lengths.txt -rw-r--r-- 1 kkarri pulmseq 234178 Aug 24 04:42 /scratch/383345.1.linga/Input/F_Cytoplasmic0.out -rw-r--r-- 1 kkarri pulmseq 9 Aug 24 04:42 /scratch/383345.1.linga/Input/F_Cytoplasmic0_num_mapped_reads.txt -rw-r--r-- 1 kkarri pulmseq 234807 Aug 24 04:42 /scratch/383345.1.linga/Input/M_Cytoplasmic0.out -rw-r--r-- 1 kkarri pulmseq 9 Aug 24 04:42 /scratch/383345.1.linga/Input/M_Cytoplasmic0_num_mapped_reads.txt /scratch/383345.1.linga/Input/F_Cytoplasmic: total 8 drwxr-xr-x 2 kkarri pulmseq 4096 Aug 24 04:42 . drwxr-xr-x 4 kkarri pulmseq 4096 Aug 24 04:42 .. /scratch/383345.1.linga/Input/M_Cytoplasmic: total 8 drwxr-xr-x 2 kkarri pulmseq 4096 Aug 24 04:42 . drwxr-xr-x 4 kkarri pulmseq 4096 Aug 24 04:42 .. ========================================================== Starting to run my commands Printing Rscript command: Rscript differentialAnalysis.R F_Cytoplasmic M_Cytoplasmic 1 1 RefSeq_GeneBody.gtf /scratch/383345.1.linga/Input DiffExp_v2_GeneBody Exon_Regions_Lengths.txt [1] "Arguments for differentialAnalysisDESeq.R:" [1] "F_Cytoplasmic" [1] "M_Cytoplasmic" [1] 1 [1] 1 [1] "RefSeq_GeneBody.gtf" [1] "/scratch/383345.1.linga/Input" [1] "DiffExp_v2_GeneBody" [1] "Exon_Regions_Lengths.txt" load GTF file ... parse attributes ... [1] "output file is in: /scratch/383345.1.linga/Input/DiffExp_v2_GeneBody_F_Cytoplasmic_M_Cytoplasmic.txt" ========================================================== Create SEGEX formatted file: Printing Rscript command: Rscript formatForSegex_ver3.R DiffExp_v2_GeneBody_F_Cytoplasmic_M_Cytoplasmic.txt M_Cytoplasmic_G160_M6_vs_F_Cytoplasmic_G160_M4_DiffExp_v2_GeneBody_forSEGEXUpload 1 GeneBody ========================================================== Comparison_Info: M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4 #---------------------------------------------------------------------------------- Running Diff_Genes.R #---------------------------------------------------------------------------------- Printing Rscript command: Rscript Diff_Genes.R M_Cytoplasmic_G160_M6_vs_F_Cytoplasmic_G160_M4_DiffExp_v2_GeneBody_forSEGEXUpload_DESeq.txt 2 0.05 DESeq GeneBody'_'M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4 [1] "Print arguments:" [1] "-----------------" [1] "Differential expression_Output:" [1] "M_Cytoplasmic_G160_M6_vs_F_Cytoplasmic_G160_M4_DiffExp_v2_GeneBody_forSEGEXUpload_DESeq.txt" [1] "fold_change_cutoff:" [1] "2" [1] "padj_cutoff:" [1] "0.05" [1] "postfix:" [1] "DESeq" [1] "count_method:" [1] "GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4" [1] "-----------------" [1] "Differential Gene Counts" [1] "Number of significant differential genes with positive fold change (Up Genes):" [1] 0 7 [1] "Number of significant differential genes with negative fold change (Down Genes):" [1] 0 7 [1] "Check out Up_Genes_DESeq_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt!" [1] "Check out Down_Genes_DESeq_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt!" Rscript Diff_Genes.R M_Cytoplasmic_G160_M6_vs_F_Cytoplasmic_G160_M4_DiffExp_v2_GeneBody_forSEGEXUpload_EdgeR.txt 2 0.05 EdgeR GeneBody'_'M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4 [1] "Print arguments:" [1] "-----------------" [1] "Differential expression_Output:" [1] "M_Cytoplasmic_G160_M6_vs_F_Cytoplasmic_G160_M4_DiffExp_v2_GeneBody_forSEGEXUpload_EdgeR.txt" [1] "fold_change_cutoff:" [1] "2" [1] "padj_cutoff:" [1] "0.05" [1] "postfix:" [1] "EdgeR" [1] "count_method:" [1] "GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4" [1] "-----------------" [1] "Differential Gene Counts" [1] "Number of significant differential genes with positive fold change (Up Genes):" [1] 79 7 [1] "Number of significant differential genes with negative fold change (Down Genes):" [1] 70 7 [1] "Check out Up_Genes_EdgeR_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt!" [1] "Check out Down_Genes_EdgeR_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt!" #---------------------------------------------------------------------------------- Running Venn_Diff_Genes.R #---------------------------------------------------------------------------------- Printing Rscript command: #---------------------------------------------------------------------------------- Rscript Venn_Diff_Genes.R Down_Genes_DESeq_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt Down_Genes_EdgeR_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt GeneBody_Counting DiffExp_5a [1] "Print arguments:" [1] "-----------------" [1] "File1:" [1] "Down_Genes_DESeq_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt" [1] "File2:" [1] "Down_Genes_EdgeR_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt" [1] "Subtitle:" [1] "GeneBody_Counting" [1] "DiffExp_Index:" [1] "DiffExp_5a" [1] "-----------------" [1] "Down.DESeq.GeneBody.HTSeq" [1] "Down.EdgeR.GeneBody.HTSeq" null device 1 [1] "Removing VennDiagram*.log files" [1] "Check out Venn diagram and Count.Table!" #---------------------------------------------------------------------------------- Rscript Venn_Diff_Genes.R Up_Genes_DESeq_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt Up_Genes_EdgeR_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt GeneBody_Counting DiffExp_5a [1] "Print arguments:" [1] "-----------------" [1] "File1:" [1] "Up_Genes_DESeq_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt" [1] "File2:" [1] "Up_Genes_EdgeR_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt" [1] "Subtitle:" [1] "GeneBody_Counting" [1] "DiffExp_Index:" [1] "DiffExp_5a" [1] "-----------------" [1] "Up.DESeq.GeneBody.HTSeq" [1] "Up.EdgeR.GeneBody.HTSeq" null device 1 [1] "Removing VennDiagram*.log files" [1] "Check out Venn diagram and Count.Table!" #---------------------------------------------------------------------------------- #---------------------------------------------------------------------------------- Merging Count.Table(s) into one text file #---------------------------------------------------------------------------------- ========================================================== Renaming the Differential_Expression_File ========================================================== List files in Input -rw-r--r-- 1 kkarri pulmseq 288 Aug 24 04:44 /scratch/383345.1.linga/Input/DiffExp_5a_Venn_Tables_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt -rw-r--r-- 1 kkarri pulmseq 4568267 Aug 24 04:44 /scratch/383345.1.linga/Input/DiffExp_v2_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt -rwxr-xr-x 1 kkarri pulmseq 7394 Aug 24 04:42 /scratch/383345.1.linga/Input/Diff_Genes.R -rw-r--r-- 1 kkarri pulmseq 151 Aug 24 04:44 /scratch/383345.1.linga/Input/Down_Genes_DESeq_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt -rw-r--r-- 1 kkarri pulmseq 8691 Aug 24 04:44 /scratch/383345.1.linga/Input/Down_Genes_EdgeR_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt -rw-r--r-- 1 kkarri pulmseq 291522 Aug 24 04:42 /scratch/383345.1.linga/Input/Exon_Regions_Lengths.txt -rw-r--r-- 1 kkarri pulmseq 234178 Aug 24 04:42 /scratch/383345.1.linga/Input/F_Cytoplasmic0.out -rw-r--r-- 1 kkarri pulmseq 9 Aug 24 04:42 /scratch/383345.1.linga/Input/F_Cytoplasmic0_num_mapped_reads.txt -rw-r--r-- 1 kkarri pulmseq 234807 Aug 24 04:42 /scratch/383345.1.linga/Input/M_Cytoplasmic0.out -rw-r--r-- 1 kkarri pulmseq 9 Aug 24 04:42 /scratch/383345.1.linga/Input/M_Cytoplasmic0_num_mapped_reads.txt -rw-r--r-- 1 kkarri pulmseq 2184791 Aug 24 04:44 /scratch/383345.1.linga/Input/M_Cytoplasmic_G160_M6_vs_F_Cytoplasmic_G160_M4_DiffExp_v2_GeneBody_forSEGEXUpload_DESeq.txt -rw-r--r-- 1 kkarri pulmseq 3975369 Aug 24 04:44 /scratch/383345.1.linga/Input/M_Cytoplasmic_G160_M6_vs_F_Cytoplasmic_G160_M4_DiffExp_v2_GeneBody_forSEGEXUpload_EdgeR.txt -rw-r--r-- 1 kkarri pulmseq 151 Aug 24 04:44 /scratch/383345.1.linga/Input/Up_Genes_DESeq_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt -rw-r--r-- 1 kkarri pulmseq 10423 Aug 24 04:44 /scratch/383345.1.linga/Input/Up_Genes_EdgeR_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4.txt -rwxr-xr-x 1 kkarri pulmseq 9553 Aug 24 04:42 /scratch/383345.1.linga/Input/Venn_Diff_Genes.R -rw-r--r-- 1 kkarri pulmseq 31289 Aug 24 04:44 /scratch/383345.1.linga/Input/Venn_Down.DESeq.GeneBody.HTSeq.Down.EdgeR.GeneBody.HTSeq.png -rw-r--r-- 1 kkarri pulmseq 30156 Aug 24 04:44 /scratch/383345.1.linga/Input/Venn_Up.DESeq.GeneBody.HTSeq.Up.EdgeR.GeneBody.HTSeq.png -rwxr-xr-x 1 kkarri pulmseq 3925 Aug 24 04:42 /scratch/383345.1.linga/Input/formatForSegex_ver3.R /scratch/383345.1.linga/Input/F_Cytoplasmic: total 8 drwxr-xr-x 2 kkarri pulmseq 4096 Aug 24 04:42 . drwxr-xr-x 4 kkarri pulmseq 4096 Aug 24 04:44 .. /scratch/383345.1.linga/Input/M_Cytoplasmic: total 8 drwxr-xr-x 2 kkarri pulmseq 4096 Aug 24 04:42 . drwxr-xr-x 4 kkarri pulmseq 4096 Aug 24 04:44 .. ========================================================== List files in scratch total 72M drwxr-xr-x 3 kkarri pulmseq 4.0K Aug 24 04:44 . drwxrwxrwt. 29 root root 20K Aug 24 04:42 .. -rw-r--r-- 1 kkarri pulmseq 56 Aug 24 04:42 Condition_1.txt -rw-r--r-- 1 kkarri pulmseq 54 Aug 24 04:42 Condition_2.txt drwxr-xr-x 4 kkarri pulmseq 4.0K Aug 24 04:44 Input -rw-r--r-- 1 kkarri pulmseq 72M Aug 24 04:42 RefSeq_GeneBody.gtf -rwxr-xr-x 1 kkarri pulmseq 16K Aug 24 04:42 differentialAnalysis.R ========================================================== Re-naming files in OUTPUT_DIR Need to append the COUNT_PROGRAM name to all output files List files in OUTPUT_DIR total 8.1M drwxr-sr-x 2 kkarri waxmanlab 32K Aug 24 04:44 . drwxr-sr-x 6 kkarri waxmanlab 32K Aug 24 04:42 .. -rw-r--r-- 1 kkarri waxmanlab 288 Aug 24 04:44 DiffExp_5a_Venn_Tables_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4_HTSeq.txt -rw-r--r-- 1 kkarri waxmanlab 4.4M Aug 24 04:44 DiffExp_v2_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4_HTSeq.txt -rw-r--r-- 1 kkarri waxmanlab 151 Aug 24 04:44 Down_Genes_DESeq_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4_HTSeq.txt -rw-r--r-- 1 kkarri waxmanlab 8.5K Aug 24 04:44 Down_Genes_EdgeR_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4_HTSeq.txt -rw-r--r-- 1 kkarri waxmanlab 2.1M Aug 24 04:44 M_Cytoplasmic_G160_M6_vs_F_Cytoplasmic_G160_M4_DiffExp_v2_GeneBody_forSEGEXUpload_DESeq_HTSeq.txt -rw-r--r-- 1 kkarri waxmanlab 3.8M Aug 24 04:44 M_Cytoplasmic_G160_M6_vs_F_Cytoplasmic_G160_M4_DiffExp_v2_GeneBody_forSEGEXUpload_EdgeR_HTSeq.txt -rw-r--r-- 1 kkarri waxmanlab 151 Aug 24 04:44 Up_Genes_DESeq_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4_HTSeq.txt -rw-r--r-- 1 kkarri waxmanlab 11K Aug 24 04:44 Up_Genes_EdgeR_GeneBody_M_Cytoplasmic_G160_M6_F_Cytoplasmic_G160_M4_HTSeq.txt -rw-r--r-- 1 kkarri waxmanlab 31K Aug 24 04:44 Venn_Down.DESeq.GeneBody.HTSeq.Down.EdgeR.GeneBody.HTSeq_HTSeq.png -rw-r--r-- 1 kkarri waxmanlab 30K Aug 24 04:44 Venn_Up.DESeq.GeneBody.HTSeq.Up.EdgeR.GeneBody.HTSeq_HTSeq.png ========================================================== ========================================================== Finished on : Thu Aug 24 04:44:06 EDT 2017 1 minutes and 21 seconds elapsed. ==========================================================