Print whole numbers as integers in Pandas LaTeX conversion

huangapple go评论89阅读模式
英文:

Print whole numbers as integers in Pandas LaTeX conversion

问题

有没有办法确保整数始终显示为整数?

翻译好的部分:
"Is there any way to make sure that whole numbers are always displayed as integers?"

英文:

I'm trying to write a small script to print LaTeX tables based on CSV.
A lot of the functionality formerly in e.g. matrix2latex have now been included in Pandas proper, which is cool.

However, no matter what I do (I tried a number of the other suggestions on here, it ended up becoming a convoluted mess which in effect changes nothing), it keeps coming out like this:

  1. [deco]/tmp/table python lala.py
  2. Dataframe:
  3. Unnamed: 0 Treatment Implant Transgenic Untreated Mice Transgenic Treated Mice Wildtype Mice Total Mice
  4. 0 P1 Armodafinil VTA 20+15 20.0 5.0 60
  5. 1 P2 NaN LC 50 NaN 10.0 60
  6. 2 P3 Escitalopram DR 20 20.0 NaN 40
  7. 3 P4 Reboxetine LC 20 20.0 NaN 40
  8. LaTeX Table Conversion:
  9. \begin{tabular}{lllllrrr}
  10. & Unnamed: 0 & Treatment & Implant & Transgenic Untreated Mice & Transgenic Treated Mice & Wildtype Mice & Total Mice \\
  11. 0 & P1 & Armodafinil & VTA & 20+15 & 20.000000 & 5.000000 & 60 \\
  12. 1 & P2 & nan & LC & 50 & nan & 10.000000 & 60 \\
  13. 2 & P3 & Escitalopram & DR & 20 & 20.000000 & nan & 40 \\
  14. 3 & P4 & Reboxetine & LC & 20 & 20.000000 & nan & 40 \\
  15. \end{tabular}
  16. [deco]/tmp/table cat lala.py
  17. import pandas as pd
  18. df = pd.read_csv("table.csv")
  19. print("\n")
  20. print("Dataframe:")
  21. print("")
  22. print(df)
  23. tex = df.style.to_latex()
  24. print("\n")
  25. print("LaTeX Table Conversion:")
  26. print("")
  27. print(tex)
  28. [deco]/tmp/table cat table.csv
  29. ,Treatment,Implant,Transgenic Untreated Mice,Transgenic Treated Mice,Wildtype Mice,Total Mice
  30. P1,Armodafinil,VTA,20+15,20,5,60
  31. P2,N/A,LC,50,,10,60
  32. P3,Escitalopram,DR,20,20,,40
  33. P4,Reboxetine,LC,20,20,,40

Is there any way to make sure that whole numbers are always displayed as integers?

答案1

得分: 1

你似乎遇到的问题是缺失的表格条目被解释为NaN,从而强制整个列变为浮点数,你可以通过使用以下方式防止空条目被解释为NaN并使它们保持为空:

  1. import pandas as pd
  2. df = pd.read_csv("table.csv", keep_default_na=False)
  3. print(df)
  4. tex = df.style.to_latex()
  5. print(tex)

这将导致以下输出:

  1. Unnamed: 0 Treatment Implant Transgenic Untreated Mice Transgenic Treated Mice Wildtype Mice Total Mice
  2. 0 P1 Armodafinil VTA 20+15 20 5 60
  3. 1 P2 N/A LC 50 10 60
  4. 2 P3 Escitalopram DR 20 20 40
  5. 3 P4 Reboxetine LC 20 20 40
  6. \begin{tabular}{lllllllr}
  7. & Unnamed: 0 & Treatment & Implant & Transgenic Untreated Mice & Transgenic Treated Mice & Wildtype Mice & Total Mice \\
  8. 0 & P1 & Armodafinil & VTA & 20+15 & 20 & 5 & 60 \\
  9. 1 & P2 & N/A & LC & 50 & & 10 & 60 \\
  10. 2 & P3 & Escitalopram & DR & 20 & 20 & & 40 \\
  11. 3 & P4 & Reboxetine & LC & 20 & 20 & & 40 \\
  12. \end{tabular}

如果您对行号不满意,您可能希望删除csv文件第一行的第一个逗号,以获得类似以下的输出:

  1. Treatment Implant Transgenic Untreated Mice Transgenic Treated Mice Wildtype Mice Total Mice
  2. P1 Armodafinil VTA 20+15 20 5 60
  3. P2 N/A LC 50 10 60
  4. P3 Escitalopram DR 20 20 40
  5. P4 Reboxetine LC 20 20 40
  6. \begin{tabular}{llllllr}
  7. & Treatment & Implant & Transgenic Untreated Mice & Transgenic Treated Mice & Wildtype Mice & Total Mice \\
  8. P1 & Armodafinil & VTA & 20+15 & 20 & 5 & 60 \\
  9. P2 & N/A & LC & 50 & & 10 & 60 \\
  10. P3 & Escitalopram & DR & 20 & 20 & & 40 \\
  11. P4 & Reboxetine & LC & 20 & 20 & & 40 \\
  12. \end{tabular}
英文:

the issue, that you seem to be facing is that missing table entries are interpreted as NaN, which then forces the entire column to float you can prevent the empty entries from getting read as NaN and have them just left empty by using

  1. import pandas as pd
  2. df = pd.read_csv("table.csv", keep_default_na=False)
  3. print(df)
  4. tex = df.style.to_latex()
  5. print(tex)

this leads to the following output:

  1. Unnamed: 0 Treatment Implant Transgenic Untreated Mice Transgenic Treated Mice Wildtype Mice Total Mice
  2. 0 P1 Armodafinil VTA 20+15 20 5 60
  3. 1 P2 N/A LC 50 10 60
  4. 2 P3 Escitalopram DR 20 20 40
  5. 3 P4 Reboxetine LC 20 20 40
  6. \begin{tabular}{lllllllr}
  7. & Unnamed: 0 & Treatment & Implant & Transgenic Untreated Mice & Transgenic Treated Mice & Wildtype Mice & Total Mice \\
  8. 0 & P1 & Armodafinil & VTA & 20+15 & 20 & 5 & 60 \\
  9. 1 & P2 & N/A & LC & 50 & & 10 & 60 \\
  10. 2 & P3 & Escitalopram & DR & 20 & 20 & & 40 \\
  11. 3 & P4 & Reboxetine & LC & 20 & 20 & & 40 \\
  12. \end{tabular}

if you're dissatisfied with the numbering of the rows you might want to remove the first comma on the first line of your csv to get an output that looks like this:

  1. Treatment Implant Transgenic Untreated Mice Transgenic Treated Mice Wildtype Mice Total Mice
  2. P1 Armodafinil VTA 20+15 20 5 60
  3. P2 N/A LC 50 10 60
  4. P3 Escitalopram DR 20 20 40
  5. P4 Reboxetine LC 20 20 40
  6. \begin{tabular}{llllllr}
  7. & Treatment & Implant & Transgenic Untreated Mice & Transgenic Treated Mice & Wildtype Mice & Total Mice \\
  8. P1 & Armodafinil & VTA & 20+15 & 20 & 5 & 60 \\
  9. P2 & N/A & LC & 50 & & 10 & 60 \\
  10. P3 & Escitalopram & DR & 20 & 20 & & 40 \\
  11. P4 & Reboxetine & LC & 20 & 20 & & 40 \\
  12. \end{tabular}

答案2

得分: 0

你可以在转换为LaTeX之前使用.astype(int)函数格式化整数。此外,你可以使用格式选项来控制LaTeX中所有列的格式:

  1. import pandas as pd
  2. df = pd.read_csv("table.csv")
  3. # 格式化列为整数
  4. df = df.astype({"Transgenic Untreated Mice": int, "Transgenic Treated Mice": int, "Wildtype Mice": int, "Total Mice": int})
  5. # 在LaTeX中将所有列格式化为整数
  6. tex = df.style.format("{:.0f}").to_latex()
  7. print("\n")
  8. print("数据框:")
  9. print("")
  10. print(df)
  11. print("\n")
  12. print("LaTeX表格转换:")
  13. print("")
  14. print(tex)
英文:

You can format the integers using the .astype(int) function before converting to LaTeX. Also, you can use the format option to control the formatting of all columns in LaTeX:

  1. import pandas as pd
  2. df = pd.read_csv("table.csv")
  3. # Format columns as integers
  4. df = df.astype({"Transgenic Untreated Mice": int, "Transgenic Treated Mice": int, "Wildtype Mice": int, "Total Mice": int})
  5. # Format all columns as integers in LaTeX
  6. tex = df.style.format("{:.0f}").to_latex()
  7. print("\n")
  8. print("Dataframe:")
  9. print("")
  10. print(df)
  11. print("\n")
  12. print("LaTeX Table Conversion:")
  13. print("")
  14. print(tex)

答案3

得分: 0

以下是翻译好的部分:

"A more general way to do that would be to convert the missing values to an integer (if possible) and to convert the datatype using convert_dtypes."

更一般的做法是将缺失值转换为整数(如果可能的话),并使用 convert_dtypes 转换数据类型。

"The same works without replacing nan values, convert_dtypes then replaces the datatype to Int64."

不替换NaN值的情况下也适用,convert_dtypes 然后将数据类型替换为Int64。

  1. import pandas as pd
  2. df = pd.read_csv("table.csv")
  3. print("\n")
  4. print("Dataframe:")
  5. print("")
  6. print(df)
  7. tex = df.fillna(-1).convert_dtypes().to_latex()
  8. print("\n")
  9. print("LaTeX Table Conversion:")
  10. print("")
  11. print(tex)
  1. LaTeX 表格转换:
  2. \begin{tabular}{lllllrrr}
  3. \toprule
  4. {} & Unnamed: 0 & Treatment & Implant & Transgenic Untreated Mice & Transgenic Treated Mice & Wildtype Mice & Total Mice \\
  5. \midrule
  6. 0 & P1 & Armodafinil & VTA & 20+15 & 20 & 5 & 60 \\
  7. 1 & P2 & -1 & LC & 50 & -1 & 10 & 60 \\
  8. 2 & P3 & Escitalopram & DR & 20 & 20 & -1 & 40 \\
  9. 3 & P4 & Reboxetine & LC & 20 & 20 & -1 & 40 \\
  10. \bottomrule
  11. \end{tabular}
英文:

A more general way to do that would be to convert the missing values to an integer (if possible) and to convert the datatype using convert_dtypes
.

The same works without replacing nan values, convert_dtypes then replaces the datatype to Int64.

  1. import pandas as pd
  2. df = pd.read_csv("table.csv")
  3. print("\n")
  4. print("Dataframe:")
  5. print("")
  6. print(df)
  7. tex = df.fillna(-1).convert_dtypes().to_latex()
  8. print("\n")
  9. print("LaTeX Table Conversion:")
  10. print("")
  11. print(tex)
  1. LaTeX Table Conversion:
  2. \begin{tabular}{lllllrrr}
  3. \toprule
  4. {} & Unnamed: 0 & Treatment & Implant & Transgenic Untreated Mice & Transgenic Treated Mice & Wildtype Mice & Total Mice \\
  5. \midrule
  6. 0 & P1 & Armodafinil & VTA & 20+15 & 20 & 5 & 60 \\
  7. 1 & P2 & -1 & LC & 50 & -1 & 10 & 60 \\
  8. 2 & P3 & Escitalopram & DR & 20 & 20 & -1 & 40 \\
  9. 3 & P4 & Reboxetine & LC & 20 & 20 & -1 & 40 \\
  10. \bottomrule
  11. \end{tabular}

huangapple
  • 本文由 发表于 2023年2月6日 06:57:26
  • 转载请务必保留本文链接:https://go.coder-hub.com/75356081.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定