2023年3月23日 10:53:51go评论128阅读模式

英文:

How can I get Streamlit to display years in data frames without a comma?

问题

I am creating a Streamlit app for a final project for school. It contains two raw data frames and two graphs. However, when I post the data frames to the app, the Year columns come out with commas, i.e. 1,993 instead of 1993.

So far, I've tried saving the cleaned data with the Year columns set as int and also as objects--didn't work. I've also tried saving the cleaned data as a .csv to load into my Streamlit code instead of a .xlsx, in case there was something funky with the Excel format that caused the commas to appear--this also didn't work. I expected for the data frames to be posted to the Streamlit app in a YYYY format as opposed to a Y,YYY format, but I got the Y,YYY format instead. In the end, I used matplotlib to post the graphs since it doesn't add unnecessary commas.

This is what my streamlit code looks like:

import pandas as pd
import matplotlib.pyplot as plt
import streamlit as st
st.title('Global Biodiversity Decline')
st.write(' ')
st.write(' ')
st.write(' ')
live = pd.read_excel('living-planet-spread.xlsx')
live = live.drop(axis=1, columns='Unnamed: 0')
live['Year'] = live['Year'].astype('object')
live2 = pd.pivot_table(live, index='Year', columns='Region', values='Average Index', fill_value=0)
st.subheader('Decline of Average Index by Year')
if st.checkbox('Show Raw Biodiversity Data'):
    st.subheader('Raw Data')
    st.write(live2)
    st.caption("Data Source: World Wildlife Fund (WWF) and Zoological Society of London")
chart = pd.DataFrame(live2, columns=['Africa', 'Asia', 'Europe', 'South America', 'North America', 'World'])
fig, ax = plt.subplots(figsize=(12, 6))
ax.plot(chart)
ax.set(xlabel='Year', ylabel='Index (%)')
ax.legend(['Africa', 'Asia', 'Europe', 'South America', 'North America'])
st.pyplot(fig)
st.caption('Above is a graph plotting the average index of biodiversity per region. Note that all regions are on a steady decline, particularly Latin America which has a sharper decline than all other regions. One possible cause of this could be deforestation related to farming. See the below graph.')
st.write(' ')
st.write(' ')
st.write(' ')
# I had to set the index as 'Year' in order for the x-axis of this graph to show up as the Years instead of a numbered index
land = pd.read_excel('fao_land_data_spread.xlsx')
land = land.set_index('Year')
st.subheader('Regional Increase in Land Use for Farming by Year')
if st.checkbox('Show Raw Land Area Data'):
    st.subheader('Raw Data')
    st.write(land)
    st.caption('Data Source: UNData')
chart2 = pd.DataFrame(land, columns=['Africa', 'Asia', 'Europe', 'South America', 'North America'])
chart3 = pd.DataFrame(land, columns=['World'])
fig, ax = plt.subplots(figsize=(12, 6))
ax.plot(chart2)
ax.set(xlabel='Year', ylabel='Area (1000 Ha)e+06')
ax.legend(['Africa', 'Asia', 'Europe', 'South America', 'North America'])
st.pyplot(fig)
st.caption('Above is a graph plotting the area of farmland used per region...')
st.write(' ')
st.write(' ')
st.write(' ')
st.subheader('Global Increase in Land Use for Farming by Year')
fig, ax = plt.subplots(figsize=(12, 6))
ax.plot(chart3)
ax.set(xlabel='Year', ylabel='Area (1000 Ha)e+06')
st.pyplot(fig)
st.caption('I put the Global area of farmland in its own graph...')

And this is a sample of what each data frame looks like:

	Africa	Asia	Europe	North America	South America	World
Year						
1961	927526.222222	911930.555556	825966.444444	586216.444444	502466.333333	4.146173e+06
1962	927657.000000	913559.333333	826292.888889	585067.666667	503954.444444	4.149369e+06
1963	928080.888889	914962.222222	825754.111111	584786.000000	505403.444444	4.152637e+06
1964	928313.333333	916675.333333	825170.777778	584079.000000	506533.333333	4.155457e+06
1965	928717.111111	918125.555556	825569.555556	583276.444444	507664.888889	4.159057e+06

Region	 Year	Average Index	Upper Index	Lower Index
44	Africa	2014	32.492869	68.628636	15.238575
45	Africa	2015	31.293573	66.256152	14.669147
46	Africa	2016	32.054221	68.026893	14.968882
47	Africa	2017	34.445875	73.433580	15.991854
48	Africa	2018	34.445875	73.433580	15.991854

英文:

This is what my streamlit code looks like:

import pandas as pd
import matplotlib.pyplot as plt
import streamlit as st
st.title(&#39;Global Biodiversity Decline&#39;)
st.write(&#39; &#39;)
st.write(&#39; &#39;)
st.write(&#39; &#39;)
live=pd.read_excel(&#39;living-planet-spread.xlsx&#39;)
live=live.drop(axis=1, columns=&#39;Unnamed: 0&#39;)
live[&#39;Year&#39;]=live[&#39;Year&#39;].astype(&#39;object&#39;)
live2=pd.pivot_table(live, index=&#39;Year&#39;, columns=&#39;Region&#39;, values=&#39;Average Index&#39;, fill_value=0)
st.subheader(&#39;Decline of Average Index by Year&#39;)
if st.checkbox(&#39;Show Raw Biodiversity Data&#39;):
st.subheader(&#39;Raw Data&#39;)
st.write(live2)
st.caption(&quot;Data Source: World Wildlife Fund (WWF) and Zoological Society of London&quot;)
chart=pd.DataFrame(live2, columns=[&#39;Africa&#39;, &#39;Asia and Pacific&#39;, &#39;Europe and Central Asia&#39;, &#39;Latin America and the Carribean&#39;, &#39;North America&#39;, &#39;World&#39;])
fig, ax=plt.subplots(figsize=(12,6))
ax.plot(chart)
ax.set(xlabel=&#39;Year&#39;, ylabel=&#39;Index (%)&#39;)
ax.legend([&#39;Africa&#39;, &#39;Asia&#39;, &#39;Europe&#39;, &#39;South America&#39;, &#39;North America&#39;])
st.pyplot(fig)
st.caption(&#39;Above is a graph plotting the average index of biodiversity per region. Note that all regions are on a steady decline, particularly Latin America which has a sharper decline than all other regions. One possible cause of this could be deforestation related to farming. See the below graph.&#39;)
st.write(&#39; &#39;)
st.write(&#39; &#39;)
st.write(&#39; &#39;)
#I had to set the index as &#39;Year&#39; in order for the x-axis of this graph to show up as the Years instead of a numbered index
land=pd.read_excel(&#39;fao_land_data_spread.xlsx&#39;)
land=land.set_index(&#39;Year&#39;)
st.subheader(&#39;Regional Increase in Land Use for Farming by Year&#39;)
if st.checkbox(&#39;Show Raw Land Area Data&#39;):
st.subheader(&#39;Raw Data&#39;)
st.write(land)
st.caption(&#39;Data Source: UNData&#39;)
chart2=pd.DataFrame(land, columns=[&#39;Africa&#39;, &#39;Asia&#39;, &#39;Europe&#39;, &#39;South America&#39;, &#39;North America&#39;])
chart3=pd.DataFrame(land, columns=[&#39;World&#39;])
fig, ax=plt.subplots(figsize=(12,6))
ax.plot(chart2)
ax.set(xlabel=&#39;Year&#39;, ylabel=&#39;Area (1000 Ha)e+06&#39;)
ax.legend([&#39;Africa&#39;, &#39;Asia&#39;, &#39;Europe&#39;, &#39;South America&#39;, &#39;North America&#39;])
st.pyplot(fig)
st.caption(&#39;Above is a graph plotting the area of farmland used per region...&#39;)
st.write(&#39; &#39;)
st.write(&#39; &#39;)
st.write(&#39; &#39;)
st.subheader(&#39;Global Increase in Land Use for Farming by Year&#39;)
fig, ax=plt.subplots(figsize=(12,6))
ax.plot(chart3)
ax.set(xlabel=&#39;Year&#39;, ylabel=&#39;Area (1000 Ha)e+06&#39;)
st.pyplot(fig)
st.caption(&#39;I put the Global area of farmland in its own graph...&#39;)

And this is a sample of what each data frame looks like:

	Africa	Asia	Europe	North America	South America	World
Year						
1961	927526.222222	911930.555556	825966.444444	586216.444444	502466.333333	4.146173e+06
1962	927657.000000	913559.333333	826292.888889	585067.666667	503954.444444	4.149369e+06
1963	928080.888889	914962.222222	825754.111111	584786.000000	505403.444444	4.152637e+06
1964	928313.333333	916675.333333	825170.777778	584079.000000	506533.333333	4.155457e+06
1965	928717.111111	918125.555556	825569.555556	583276.444444	507664.888889	4.159057e+06

Region	 Year	Average Index	Upper Index	Lower Index
44	Africa	2014	32.492869	68.628636	15.238575
45	Africa	2015	31.293573	66.256152	14.669147
46	Africa	2016	32.054221	68.026893	14.968882
47	Africa	2017	34.445875	73.433580	15.991854
48	Africa	2018	34.445875	73.433580	15.991854

答案1

得分: 0

从您的描述和代码片段来看，逗号是因为从Excel中读取Year列时将其读取为数字类型引起的。在将数字类型转换为对象类型时似乎引入了逗号，这似乎是Pandas Excel读取器的默认行为。

您可以尝试将Year的数据类型指定为字符串，然后将其转换回数字或整数，如下所示：

live = pd.read_excel('living-planet-spread.xlsx', dtype={'Year': str})
# 将"Year"列转换为数字
live['Year'] = pd.to_numeric(live['Year'])
# 将"Year"列转换为整数
live['Year'] = live['Year'].astype(int)

英文:

From your description and code snippet, it seems the comma is caused by the Year column being read in as a numeric type from Excel. The comma seems to be introduced when converting the numeric type to an object type which seems to be a default behavior of Pandas excel reader.

You can try specifying the data type of Year as a String then convert it back to numeric or int as such:

live=pd.read_excel(&#39;living-planet-spread.xlsx&#39;, dtype={&#39;Year&#39;: str})
# Convert the &quot;Year&quot; column to a numeric 
live[&#39;Year&#39;] = pd.to_numeric(live[&#39;Year&#39;])
# Convert the &quot;Year&quot; column to an integer
live[&#39;Year&#39;] = live[&#39;Year&#39;].astype(int)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何使Streamlit在数据框中显示年份时不带逗号？

问题

答案1

Python subprocess (using Popen) hangs indefinitely and doesn't terminate when the script ends. Why, and how to fix?

无法安装 YOLOX。

How to register pandas dataframe as parquet or csv dataset in the container and in the Data at the same time?

将HF模型推送到Hub。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。