问题

我正在尝试使用Nextflow执行我的第一个代码，我引入了2个成对的读取并希望执行bbduk函数。我不知道为什么我的代码不起作用。

我尝试了以下代码：

#!/usr/bin/env nextflow

/*
 * Pipeline Metagenomics, conda ambient Metagenomics_Nextflow
 */

nextflow.enable.dsl=2 // Nextflow版本

params.fq1 = "$HOME/../*{1}.fq.gz" // 可以在脚本中使用--fq1 <value>命令进行修改
params.fq2 = "$HOME/..//*{2}.fq.gz" // 可以在脚本中使用--fq2 <value>命令进行修改

/*
 * Check the quality of raw sequences
 */

process BBduk {

    input:
        tuple val(meta), path(reads)
    tag {meta.id}

    output:
        tuple val(meta), path('*.fq.gz'), emit: reads
        tuple val(meta), path('*.log'), emit: log

    script:
        def prefix = file("${meta.id}")
        def raw      = "in1=${reads[0]} in2=${reads[1]}"
        def trimmed  = "out1=${prefix}_1.fastq.gz out2=${prefix}_2.fastq.gz"
        """
        echo(${prefix})
        echo(${raw})
        echo(${trimmed})
        bbduk.sh \\
        $raw \\
        $trimmed \\
        qtrim=r trimq=10 minlen=100 \\
        &> ${prefix}.bbduk.log
        """
}
/*
* Workflow
*/
workflow {
    sequences = Channel.fromFilePairs( [params.fq1,params.fq2] )
    println "Performing Quality control and triming from $sequences" 
    BBduk(sequences)
}

该代码在终端中产生以下错误。

ERROR ~ Error executing process > 'BBduk (1)'

Caused by:
  No such variable: id -- Check script 'Workflow_Clean_Human_DNA.nf' at line: 34

Source block:
  def prefix = file("${meta.id}")
  def raw      = "in1=${reads[0]} in2=${reads[1]}"
  def trimmed  = "out1=${prefix}_1.fastq.gz out2=${prefix}_2.fastq.gz"
  """
  echo(${prefix})
  echo(${raw})
  echo(${trimmed})
  bbduk.sh \\
  $raw \\
  $trimmed \\
  qtrim=r trimq=10 minlen=100 \\
  &> ${prefix}.bbduk.log
  """

代码的思想是对原始读取执行bbduk算法，我尝试了一对工作。我想在Nextflow中执行它，因为想将输出与其他算法链接在一起。

谢谢！

英文:

I'm trying to perform my first code with Next-flow, im introducing 2 paired reads and I want to execute the bbduk function. I don't know why my code didn't works.

I tryed the following code:

  #!/usr/bin/env nextflow


/*
 * Pipeline Metagenomics, conda ambient Metagenomics_Nextflow
 */

 nextflow.enable.dsl=2 // Nextflow version

params.fq1 = &quot;$HOME/../*{1}.fq.gz&quot; // Can be modified in the script with the --fq1 &lt;value&gt; command
params.fq2 = &quot;$HOME/..//*{2}.fq.gz&quot; // Can be modified in the script with the --fq2 &lt;value&gt; command
/*
 * params.out = &#39;OTU.tsv&#39;		// result file
 * params.databaseHuman = &quot;$baseDir/Database/GRCh38&quot; // 
 */

/*
 * Check the quality of raw sequences
 */

 process BBduk {

    input:
        tuple val(meta), path(reads)
    tag {meta.id}

    output:
        tuple val(meta), path(&#39;*.fq.gz&#39;), emit: reads
        tuple val(meta), path(&#39;*.log&#39;), emit: log
    
     script:
        def prefix = file(&quot;${meta.id}&quot;)
        def raw      = &quot;in1=${reads[0]} in2=${reads[1]}&quot;
        def trimmed  = &quot;out1=${prefix}_1.fastq.gz out2=${prefix}_2.fastq.gz&quot;
        &quot;&quot;&quot;
        echo(${prefix})
        echo(${raw})
        echo(${trimmed})
        bbduk.sh \\
        $raw \\
        $trimmed \\
        qtrim=r trimq=10 minlen=100 \\
        &amp;&gt; ${prefix}.bbduk.log
        &quot;&quot;&quot;

 }
/*
* Workflow
*/
workflow {
sequences = Channel.fromFilePairs( [params.fq1,params.fq2] )
println &quot;Performing Quality control and triming from $sequences&quot; 
    BBduk(sequences)
}

The code produce the following error in terminal.

 ERROR ~ Error executing process &gt; &#39;BBduk (1)&#39;

Caused by:
  No such variable: id -- Check script &#39;Workflow_Clean_Human_DNA.nf&#39; at line: 34

Source block:
  def prefix = file(&quot;${meta.id}&quot;)
  def raw      = &quot;in1=${reads[0]} in2=${reads[1]}&quot;
  def trimmed  = &quot;out1=${prefix}_1.fastq.gz out2=${prefix}_2.fastq.gz&quot;
  &quot;&quot;&quot;
          echo(${prefix})
          echo(${raw})
          echo(${trimmed})
          bbduk.sh \\
          $raw \\
          $trimmed \\
          qtrim=r trimq=10 minlen=100 \\
          &amp;&gt; ${prefix}.bbduk.log
          &quot;&quot;&quot;

Tip: you can try to figure out what&#39;s wrong by changing to the process work dir and showing the script file named `.command.sh`

 -- Check &#39;.nextflow.log&#39; file for details

The idea of the code is to perform a bbduk algorithm into raw reads, I tried to perform a duple to work. I wan't to do it in next flow because the idea is chain the output with other algorithm

Thanks!

答案1

得分: 0

fromFilePairs工厂方法发出元组，其中第一个元素是分组键（第二个元素是匹配文件的列表）。它是一个简单的字符串（java.lang.String），而不是元数据映射。我认为你所需的只是类似以下的东西：


process BBduk {

    tag { sample_id }

    input:
    tuple val(sample_id), path(reads, stageAs: 'reads/*')

    output:
    tuple val(sample_id), path("${sample_id}_{1,2}.fq.gz"), emit: reads
    tuple val(sample_id), path("${sample_id}.bbduk.log"), emit: log

    script:
    def (fq1, fq2) = reads

    """
    bbduk.sh \\
        in1="${fq1}" \\
        in2="${fq2}" \\
        out1="${sample_id}_1.fq.gz" \\
        out2="${sample_id}_2.fq.gz" \\
        qtrim=r \\
        trimq=10 \\
        minlen=100 \\
        &>> "${sample_id}.bbduk.log"
    """
}


    reads = Channel.fromFilePairs( params.reads )

    BBduk( reads )
}

请注意，通过将文件放置在子目录中（使用stageAs: 'reads/*'），我们确保不会意外覆盖工作目录中的输入文件。

英文:

The fromFilePairs factory method emits tuples in which the first element is the grouping key (and the second element is the list of matching files). It is a simple String (java.lang.String), not a Map of meta data. I think all you need is something like:

params.reads = &#39;./*_{1,2}.fq.gz&#39;


process BBduk {

    tag { sample_id }

    input:
    tuple val(sample_id), path(reads, stageAs: &#39;reads/*&#39;)

    output:
    tuple val(sample_id), path(&quot;${sample_id}_{1,2}.fq.gz&quot;), emit: reads
    tuple val(sample_id), path(&quot;${sample_id}.bbduk.log&quot;), emit: log

    script:
    def (fq1, fq2) = reads

    &quot;&quot;&quot;
    bbduk.sh \\
        in1=&quot;${fq1}&quot; \\
        in2=&quot;${fq2}&quot; \\
        out1=&quot;${sample_id}_1.fq.gz&quot; \\
        out2=&quot;${sample_id}_2.fq.gz&quot; \\
        qtrim=r \\
        trimq=10 \\
        minlen=100 \\
        &amp;&gt; &quot;${sample_id}.bbduk.log&quot;
    &quot;&quot;&quot;
}

workflow {

    reads = Channel.fromFilePairs( params.reads )

    BBduk( reads )
}

Note that by staging the files in a subdirectory (using stageAs: 'reads/*'), we ensure we don't accidentally clobber our input files in the working directory.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Nextflow 没有这样的变量: id

问题

答案1

Nextflow的CPU设置配置文件

why I got Cannot invoke method view() on null object

获取目录通道 nexflow 的文件列表。

How to reuse the same process twice in within the same module in nextflow dsl2, but saving the output in a different name?

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论