Version Info: Could not resolve upgrade information in the alotted time. Check for upgrades manually at https://combine-lab.github.io/salmon [2019-11-08 12:09:14.649] [jLog] [info] building index out : /project/shefflab/genomes/hg19_cdna/salmon_index/default [2019-11-08 12:09:14.651] [puff::index::jointLog] [info] Running fixFasta [Step 1 of 4] : counting k-mers [2019-11-08 12:09:14.709] [puff::index::jointLog] [warning] Entry with header [ENST00000415118], had length less than equal to the k-mer length of 31 (perhaps after poly-A clipping) (perhaps after poly-A clipping) [2019-11-08 12:09:23.497] [puff::index::jointLog] [warning] Entry with header [ENST00000579054], had length less than equal to the k-mer length of 31 (perhaps after poly-A clipping) [2019-11-08 12:09:23.526] [puff::index::jointLog] [warning] Entry with header [ENST00000603775], had length less than equal to the k-mer length of 31 (perhaps after poly-A clipping) [2019-11-08 12:09:23.575] [puff::index::jointLog] [warning] Removed 12985 transcripts that were sequence duplicates of indexed transcripts. [2019-11-08 12:09:23.575] [puff::index::jointLog] [warning] If you wish to retain duplicate transcripts, please use the `--keepDuplicates` flag [2019-11-08 12:09:23.587] [puff::index::jointLog] [info] Replaced 0 non-ATCG nucleotides [2019-11-08 12:09:23.587] [puff::index::jointLog] [info] Clipped poly-A tails from 1401 transcripts wrote 167220 cleaned references seqHash 256 : f88b2fba4f6c374e206112eafcccff7300d4cc6cbf59f68b513307c15bf52074 seqHash 512 : 967d1294996dd78c2ed981a62b2c93ab6dc9bebfedb2051085275c882b2d690f3a8bf43189da9a06069b0b866b958f01a3eb52aa579194f3ca7b2026bf8f51ef nameHash 256 : e52e717c3a77183e10af9ca7b3e665373bcd27c5e51d2ca6241d829c7999d79c nameHash 512 : d0835b71c9a18d3e60d2940fab623fc25f2c99eb0a684182a0e1923bba95b9d135b9bba611bcc6c9dc62673d94b8c69e2121909555e12eafd569cf7c26ec91ab [2019-11-08 12:09:25.909] [puff::index::jointLog] [info] Filter size not provided; estimating from number of distinct k-mers [2019-11-08 12:09:28.351] [puff::index::jointLog] [info] ntHll estimated 100826112 distinct k-mers, setting filter size to 2^31 Threads = 2 Vertex length = 31 Hash functions = 5 Filter size = 2147483648 Capacity = 2 Files: /project/shefflab/genomes/hg19_cdna/salmon_index/default/ref_k31_fixed.fa -------------------------------------------------------------------------------- Round 0, 0:2147483648 Pass Filling Filtering 1 113 1243 2 80 0 True junctions count = 665162 False junctions count = 998913 Hash table size = 1664075 Candidate marks count = 7805037 -------------------------------------------------------------------------------- Reallocating bifurcations time: 1 True marks count: 5143432 Edges construction time: 474 -------------------------------------------------------------------------------- Distinct junctions = 665162 approximateContigTotalLength: 72596465 counters: 38809 700 698 75 contig count: 994041 element count: 131267297 complex nodes: 40282 size: 131267297 # of ones in rank vector: 994040 size: 131267297 [2019-11-08 12:41:39.597] [puff::index::jointLog] [info] Setting the index/BinaryGfa directory /project/shefflab/genomes/hg19_cdna/salmon_index/default size = 131267297 ----------------------------------------- | Loading contigs | Time = 13.69 ms ----------------------------------------- size = 131267297 ----------------------------------------- | Loading contig boundaries | Time = 7.111 ms ----------------------------------------- Number of ones: 994040 Number of ones per inventory item: 512 Inventory entries filled: 1942 [2019-11-08 12:41:39.843] [puff::index::jointLog] [info] Done wrapping the rank vector with a rank9sel structure. [2019-11-08 12:41:40.037] [puff::index::jointLog] [info] contig count for validation: 994040 [2019-11-08 12:41:40.590] [puff::index::jointLog] [info] Total # of Contigs : 994040 [2019-11-08 12:41:40.590] [puff::index::jointLog] [info] Total # of numerical Contigs : 994040 [2019-11-08 12:41:41.879] [puff::index::jointLog] [info] Total # of segments we have position for : 994040 [2019-11-08 12:41:41.947] [puff::index::jointLog] [info] total contig vec entries 5131470 [2019-11-08 12:41:41.947] [puff::index::jointLog] [info] bits per offset entry 23 [2019-11-08 12:41:42.707] [puff::index::jointLog] [info] there were 640430 equivalence classes [2019-11-08 12:41:45.521] [puff::index::jointLog] [info] # segments = 994040 [2019-11-08 12:41:45.521] [puff::index::jointLog] [info] total length = 131267297 [2019-11-08 12:41:45.571] [puff::index::jointLog] [info] Reading the reference files ... BooPHF] 99.5 % elapsed: 0 min 20 sec remaining: 0 min 0 sec [Building BooPHF] 99.7 % elapsed: 0 min 20 sec remaining: 0 min 0 sec [Building BooPHF] 99.7 % elapsed: 0 min 20 sec remaining: 0 min 0 sec [Building BooPHF] 99.9 % elapsed: 0 min 20 sec remaining: 0 min 0 sec [Building BooPHF] 99.9 % elapsed: 0 min 20 sec remaining: 0 min 0 sec [2019-11-08 12:42:07.528] [puff::index::jointLog] [info] mphf size = 63.3658 MB [2019-11-08 12:42:07.528] [puff::index::jointLog] [info] chunk size = 65633649 [2019-11-08 12:42:07.530] [puff::index::jointLog] [info] chunk 0 = [0, 65633649) [2019-11-08 12:42:07.530] [puff::index::jointLog] [info] chunk 1 = [65633649, 131267267) [2019-11-08 12:42:38.541] [puff::index::jointLog] [info] finished populating pos vector [2019-11-08 12:42:38.541] [puff::index::jointLog] [info] writing index components [2019-11-08 12:42:40.101] [puff::index::jointLog] [info] finished writing dense pufferfish index [2019-11-08 12:42:40.181] [jLog] [info] done building index for info, total work write each : 2.331 total work inram from level 3 : 4.322 total work raw : 25.000 Bitarray 531550656 bits (100.00 %) (array + ranks ) final hash 0 bits (0.00 %) (nb in final hash 0)Command completed. Elapsed time: 0:33:26. Running peak memory: 0.483GB. PID: 92577; Command: salmon; Return code: 0; Memory used: 0.483GB > `touch /project/shefflab/genomes/hg19_cdna/salmon_index/default/_refgenie_build/hg19_cdna_salmon_index__default.flag` (100545) Command completed. Elapsed time: 0:00:00. Running peak memory: 0.483GB. PID: 100545; Command: touch; Return code: 0; Memory used: 0.0GB > `cd /project/shefflab/genomes/hg19_cdna/salmon_index/default; find . -type f -not -path './_refgenie_build*' -exec md5sum {} \; | sort -k 2 | awk '{print $1}' | md5sum` Asset digest: aa95f48e339f1167a99af2fcdbed4a22 Default tag for 'hg19_cdna/salmon_index' set to: default ### Pipeline completed. Epilogue * Elapsed time (this run): 0:33:28 * Total elapsed time (all runs): 0:33:26 * Peak memory (this run): 0.483 GB * Pipeline completed time: 2019-11-08 12:42:42 Finished building asset 'salmon_index'