2023年5月11日 18:48:11go评论66阅读模式

英文:

How to split a field and then to print the last element using awk

问题

I am trying to edit a file which has this format:

field1 field2 field3 gene_id "xxxxx"; transcript_id "XM_xxxxxxxx.x"; db_xref "GeneID:102885392"; exon_number "1";

I would like as output:

field1 field2 field3 exon_number "1";

I am using awk to do it, but I failed to print the last part of the last field after splitting it. Here is my code:

awk '{split($4,a,";"); print ($1, $2,$3, a[$NF])}' input

I know a[$NF] is not working, but how to indicate the last subfield; is it the last element of the array? (In my file exon_number is not always the 5th element, but always the last one).

英文:

I am trying to edit a file which has this format:

field1 field2 field3 gene_id &quot;xxxxx&quot;; transcript_id &quot;XM_xxxxxxxx.x&quot;; db_xref &quot;GeneID:102885392&quot;; exon_number &quot;1&quot;;

I would like as output:

field1 field2 field3 exon_number &quot;1&quot;;

I am using awk to do it, but I failed to print the last part of the last field after splitting it. Here is my code:

awk &#39;{split($4,a,&quot;;&quot;); print ($1, $2,$3, a[$NF])}&#39; input

I know a[$NF] is not working, but how to indicate the last subfield; is it the last element of the array? (In my file exon_number is not always the 5th element, but always the last one).

答案1

得分: 3

exon_number "1" 是你第二个最后一个以 ; 分隔的子字段，而不是最后一个，因为最后一个 ; 后面是一个空字符串，你正在进行拆分。

awk 'BEGIN{FS=OFS="\t"} {n=split($4,a,/[[:space:]]*;[[:space:]]*/); print $1, $2, $3, a[n-1]";"}' input

或者：

awk 'BEGIN{FS=OFS="\t"} {n=split($4,a,/[[:space:]]*;[[:space:]]*/); $4=a[n-1]";"; print}' input

在 https://www.gnu.org/software/gawk/manual/gawk.html#String-Functions 上查看 split()。

英文:

exon_number "1" is your 2nd-last ;-separated subfield, not your last one since there's a null string after the last ; you're splitting on.

awk &#39;BEGIN{FS=OFS=&quot;\t&quot;} {n=split($4,a,/[[:space:]]*;[[:space:]]*/); print $1, $2, $3, a[n-1]&quot;;&quot;}&#39; input

or:

awk &#39;BEGIN{FS=OFS=&quot;\t&quot;} {n=split($4,a,/[[:space:]]*;[[:space:]]*/); $4=a[n-1]&quot;;&quot;; print}&#39; input

See split() at https://www.gnu.org/software/gawk/manual/gawk.html#String-Functions

答案2

得分: 1

$ STR='field1<\t>field2<\t>field3<\t>gene_id "xxxxx"; transcript_id "XM_xxxxxxxx.x"; db_xref "GeneID:102885392"; exon_number "1";'

$ awk -F'; ' '{sub(/>[^>]*$/,">",$1); $0=$1 $NF}1' <<<"$STR"
field1<\t>field2<\t>field3<\t>exon_number "1";

英文:

$ STR=&#39;field1&lt;\t&gt;field2&lt;\t&gt;field3&lt;\t&gt;gene_id &quot;xxxxx&quot;; transcript_id &quot;XM_xxxxxxxx.x&quot;; db_xref &quot;GeneID:102885392&quot;; exon_number &quot;1&quot;;&#39;    

$ awk -F&#39;; &#39; &#39;{sub(/&gt;[^&gt;]*$/,&quot;&gt;&quot;,$1); $0=$1 $NF}1&#39; &lt;&lt;&lt;&quot;$STR&quot;
field1&lt;\t&gt;field2&lt;\t&gt;field3&lt;\t&gt;exon_number &quot;1&quot;;

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

怎么分割一个字段，然后用awk打印最后一个元素

问题

答案1

答案2

Get python-like split() working with go's strings.Split()

匹配包含来自另一个文件的两个字符串的行。

如何改进Golang中的分割逻辑

awk 打印除最后一列外的所有列 + 最后一列

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论