问题

以下是您要翻译的内容：

import csv

# Specify the input and output file names
input_file = 'influx.csv'
output_file = 'output.csv'

try:
    # Open the input file for reading
    with open(input_file, 'r') as csv_file:
        # Create a CSV reader object
        csv_reader = csv.reader(csv_file)

        # Skip the first row (header)
        next(csv_reader)

        # Open the output file for writing
        with open(output_file, 'w', newline='') as output_csv:
            # Create a CSV writer object
            csv_writer = csv.writer(output_csv)

            # Write the header row
            csv_writer.writerow(['_time', '_field', '_value'])

            # Iterate over the input file and write the rows to the output file
            for row in csv_reader:
                # Check if the row is not empty
                if row:
                    # Split the fields
                    fields = row[0].split(',')

                    # Write the row to the output file
                    csv_writer.writerow(fields)

    print(f'{input_file} converted to {output_file} successfully!')

except FileNotFoundError:
    print(f'Error: File {input_file} not found.')

except Exception as e:
    print(f'Error: {e}')

如果您需要任何其他翻译，请告诉我。

英文:

I have a CSV file that was downloaded from InfluxDB UI. I want to extract useful data from the downloaded file. A snippet of the downloaded file is as follows:

#group	FALSE	FALSE	TRUE	TRUE	FALSE	FALSE	TRUE	TRUE	TRUE	TRUE	TRUE
#datatype	string	long	dateTime:RFC3339	dateTime:RFC3339	dateTime:RFC3339	double	string	string	string	string	string
#default	mean										
	result	table	_start	_stop	_time	_value	_field	_measurement	smart_module	serial	type
		0	2023-03-31T08:12:40.697076925Z	2023-03-31T09:12:40.697076925Z	2023-03-31T08:20:00Z	0	sm_alarm	system_test	8	2.14301E+11	sm_extended
		0	2023-03-31T08:12:40.697076925Z	2023-03-31T09:12:40.697076925Z	2023-03-31T08:40:00Z	0	sm_alarm	system_test	8	2.14301E+11	sm_extended
		0	2023-03-31T08:12:40.697076925Z	2023-03-31T09:12:40.697076925Z	2023-03-31T09:00:00Z	0	sm_alarm	system_test	8	2.14301E+11	sm_extended
		0	2023-03-31T08:12:40.697076925Z	2023-03-31T09:12:40.697076925Z	2023-03-31T09:12:40.697076925Z	0	sm_alarm	system_test	8	2.14301E+11	sm_extended

I'd like to have the output CSV as follows:

_time                   sm_alarm  next_column next_column ....... ...........
2023-03-29T08:41:15Z    0

Please note that sm_alarm is only one field among 9 others (that are under _filed).

I tried to do with the following script, but could not solve my problem.

import csv

# Specify the input and output file names
input_file = &#39;influx.csv&#39;
output_file = &#39;output.csv&#39;

try:
    # Open the input file for reading
    with open(input_file, &#39;r&#39;) as csv_file:
        # Create a CSV reader object
        csv_reader = csv.reader(csv_file)

        # Skip the first row (header)
        next(csv_reader)

        # Open the output file for writing
        with open(output_file, &#39;w&#39;, newline=&#39;&#39;) as output_csv:
            # Create a CSV writer object
            csv_writer = csv.writer(output_csv)

            # Write the header row
            csv_writer.writerow([&#39;_time&#39;, &#39;_field&#39;, &#39;_value&#39;])

            # Iterate over the input file and write the rows to the output file
            for row in csv_reader:
                # Check if the row is not empty
                if row:
                    # Split the fields
                    fields = row[0].split(&#39;,&#39;)

                    # Write the row to the output file
                    csv_writer.writerow(fields)

    print(f&#39;{input_file} converted to {output_file} successfully!&#39;)

except FileNotFoundError:
    print(f&#39;Error: File {input_file} not found.&#39;)

except Exception as e:
    print(f&#39;Error: {e}&#39;)

Thank you.

答案1

得分: 1

以下是翻译好的部分：

import pandas as pd

with open("influx.csv", "r") as csv_file:
    headers = csv_file.readlines()[3].strip().split()[1:]
    
df = pd.read_csv("influx.csv", header=None, skiprows=4, sep="\s+",
                 engine="python", names=headers).iloc[:, 1:]

#print(df)

print(df)

                               _start                           _stop                           _time  _value    _field _measurement  smart_module        serial         type
    0  2023-03-31T08:12:40.697076925Z  2023-03-31T09:12:40.697076925Z            2023-03-31T08:20:00Z       0  sm_alarm  system_test             8  2.143010e+11  sm_extended
    1  2023-03-31T08:12:40.697076925Z  2023-03-31T09:12:40.697076925Z            2023-03-31T08:40:00Z       0  sm_alarm  system_test             8  2.143010e+11  sm_extended
    2  2023-03-31T08:12:40.697076925Z  2023-03-31T09:12:40.697076925Z            2023-03-31T09:00:00Z       0  sm_alarm  system_test             8  2.143010e+11  sm_extended
    3  2023-03-31T08:12:40.697076925Z  2023-03-31T09:12:40.697076925Z  2023-03-31T09:12:40.697076925Z       0  sm_alarm  system_test             8  2.143010e+11  sm_extended

英文:

The format of your expected output is ambiguous and not fully clear.
But as a starting point, you can straighten your file with read_csv from [tag:pandas] this way :

import pandas as pd

with open(&quot;influx.csv&quot;, &quot;r&quot;) as csv_file:
    headers = csv_file.readlines()[3].strip().split()[1:]
    
df = pd.read_csv(&quot;influx.csv&quot;, header=None, skiprows=4, sep=&quot;\s+&quot;,
                 engine=&quot;python&quot;, names=headers).iloc[:, 1:]

#df.to_csv(&quot;output.csv&quot;, index=False, sep=&quot;,&quot;) # &lt;- uncomment this line to make a real csv

Output :

print(df)

                           _start                           _stop                           _time  _value    _field _measurement  smart_module        serial         type
0  2023-03-31T08:12:40.697076925Z  2023-03-31T09:12:40.697076925Z            2023-03-31T08:20:00Z       0  sm_alarm  system_test             8  2.143010e+11  sm_extended
1  2023-03-31T08:12:40.697076925Z  2023-03-31T09:12:40.697076925Z            2023-03-31T08:40:00Z       0  sm_alarm  system_test             8  2.143010e+11  sm_extended
2  2023-03-31T08:12:40.697076925Z  2023-03-31T09:12:40.697076925Z            2023-03-31T09:00:00Z       0  sm_alarm  system_test             8  2.143010e+11  sm_extended
3  2023-03-31T08:12:40.697076925Z  2023-03-31T09:12:40.697076925Z  2023-03-31T09:12:40.697076925Z       0  sm_alarm  system_test             8  2.143010e+11  sm_extended

If you share a clear expected ouptut, I'll update my answer accordingly.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Transforming annotated csv (influxdb) to normal csv file using python script

问题

答案1

Polars – ComputeError: 从NumPy数组转换后无法将类型转换为’Object’类型

Python分离数值

如何在Go应用程序中创建多个Python实例

Python 插入到 JSON

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论