2023年3月7日 10:36:46go评论86阅读模式

英文:

odd result when ingesting dataframe into influxdb 2.x

问题

你测试了官方的示例代码并成功地将数据写入InfluxDB，但是在仪表板上看不到数据。可能是仪表板的配置问题，需要确保在仪表板中使用正确的查询来显示数据。请检查仪表板的查询设置，确保查询中使用了正确的数据库、测量和字段。另外，也请确保仪表板的时间范围包含你写入数据的时间段，否则数据将不会显示在仪表板上。

英文:

I'm using python to ingest data into influxdb,

but many examples just don't work (empty table, but no error reported), and one example with odd result.

With client set, I create dataframe by:

from datetime import datetime
from datetime import timedelta
_now = datetime.utcnow()
_data_frame = pd.DataFrame(data=[[&quot;coyote_creek&quot;, 1.0], [&quot;coyote_creek&quot;, 2.0]],
                           index=[_now, _now + timedelta(hours=1)],
                           columns=[&quot;location&quot;, &quot;water_level&quot;])

then I get the data as:

_data_frame
Out[24]: 
                                location  water_level
2023-03-07 02:04:11.642867  coyote_creek          1.0
2023-03-07 03:04:11.642867  coyote_creek          2.0

Thus, I should have two data after ingesting, however, when I ingest data by:

write_api.write(&quot;testing_for_dataframe&quot;, &quot;org&quot;, record=_data_frame, data_frame_measurement_name=&#39;h2o_feet&#39;,
                            data_frame_tag_columns=[&#39;location&#39;])

I only queried one row of data by:

query_api.query_data_frame(&#39;from(bucket:&quot;testing_for_dataframe&quot;)|&gt; range(start: -10m)&#39;)

I got:

Out[31]: 
    result  table  ... _measurement      location
0  _result      0  ...     h2o_feet  coyote_creek
[1 rows x 9 columns]

It can totally work when try example of:

from influxdb_client import InfluxDBClient, Point, Dialect
from influxdb_client.client.write_api import SYNCHRONOUS
client = InfluxDBClient(url=&quot;http://localhost:8086&quot;, token=&quot;my-token&quot;, org=&quot;my-org&quot;)
write_api = client.write_api(write_options=SYNCHRONOUS)
query_api = client.query_api()
&quot;&quot;&quot;
Prepare data
&quot;&quot;&quot;
_point1 = Point(&quot;my_measurement&quot;).tag(&quot;location&quot;, &quot;Prague&quot;).field(&quot;temperature&quot;, 25.3)
_point2 = Point(&quot;my_measurement&quot;).tag(&quot;location&quot;, &quot;New York&quot;).field(&quot;temperature&quot;, 24.3)
write_api.write(bucket=&quot;my-bucket&quot;, record=[_point1, _point2])
&quot;&quot;&quot;
Query: using Pandas DataFrame
&quot;&quot;&quot;
data_frame = query_api.query_data_frame(&#39;from(bucket:&quot;my-bucket&quot;) &#39;
                                        &#39;|&gt; range(start: -10m) &#39;
                                        &#39;|&gt; pivot(rowKey:[&quot;_time&quot;], columnKey: [&quot;_field&quot;], valueColumn: &quot;_value&quot;) &#39;
                                        &#39;|&gt; keep(columns: [&quot;location&quot;, &quot;temperature&quot;])&#39;)
print(data_frame.to_string())
&quot;&quot;&quot;
Close client
&quot;&quot;&quot;
client.close()

I'm not familiar with influxdb querying grammar, but it is not the key point since I check on dashboard which also shows that it only has one row of data.

enter image description here

MY PROBLEM IS: "Sometimes, it cannot ingest any of the data, only create the named table with empty result; Meanwhile, it will ingest part of the data and lost some without error; Besides, there are correctly worked example, so it's not due to other set issue i guess."

I want to ingest a dataframe, and want to see it in dashboard (means it can be queried in python)

Now I tested official example:


import pandas as pd
from influxdb_client import InfluxDBClient
from influxdb_client.client.write_api import SYNCHRONOUS, PointSettings
&quot;&quot;&quot;
Load DataFrame form CSV File
&quot;&quot;&quot;
df = pd.read_csv(&quot;vix-daily.csv&quot;)
print(df.head())
with InfluxDBClient(url=&quot;http://localhost:8086&quot;, token=&quot;my-token&quot;, org=&quot;my-org&quot;) as client:
    &quot;&quot;&quot;
    Ingest DataFrame with default tags
    &quot;&quot;&quot;
    point_settings = PointSettings(**{&quot;type&quot;: &quot;vix-daily&quot;})
    point_settings.add_default_tag(&quot;example-name&quot;, &quot;ingest-data-frame&quot;)
    write_api = client.write_api(write_options=SYNCHRONOUS, point_settings=point_settings)
    write_api.write(bucket=&quot;my-bucket&quot;, record=df, data_frame_measurement_name=&quot;financial-analysis-df&quot;)
    &quot;&quot;&quot;
    Querying ingested data
    &quot;&quot;&quot;
    query = &#39;from(bucket:&quot;my-bucket&quot;)&#39; \
            &#39; |&gt; range(start: 0, stop: now())&#39; \
            &#39; |&gt; filter(fn: (r) =&gt; r._measurement == &quot;financial-analysis-df&quot;)&#39; \
            &#39; |&gt; pivot(rowKey:[&quot;_time&quot;], columnKey: [&quot;_field&quot;], valueColumn: &quot;_value&quot;)&#39; \
            &#39; |&gt; limit(n:10, offset: 0)&#39;
    result = client.query_api().query(query=query)
    &quot;&quot;&quot;
    Processing results
    &quot;&quot;&quot;
    print()
    print(&quot;=== results ===&quot;)
    print()
    for table in result:
        for record in table.records:
            print(&#39;{4}: Open {0}, Close {1}, High {2}, Low {3}&#39;.format(record[&quot;VIX Open&quot;], record[&quot;VIX Close&quot;],
                                                                       record[&quot;VIX High&quot;], record[&quot;VIX Low&quot;],
                                                                       record[&quot;type&quot;]))

And get:

         Date  VIX Open  VIX High  VIX Low  VIX Close
0  2004-01-02     17.96     18.68    17.54      18.22
1  2004-01-05     18.45     18.49    17.44      17.49
2  2004-01-06     17.66     17.67    16.19      16.73
3  2004-01-07     16.72     16.75    15.50      15.50
4  2004-01-08     15.42     15.68    15.32      15.61
=== results ===
vix-daily: Open 17.96, Close 18.22, High 18.68, Low 17.54
vix-daily: Open 18.45, Close 17.49, High 18.49, Low 17.44
vix-daily: Open 17.66, Close 16.73, High 17.67, Low 16.19
vix-daily: Open 16.72, Close 15.5, High 16.75, Low 15.5
vix-daily: Open 15.42, Close 15.61, High 15.68, Low 15.32
vix-daily: Open 16.15, Close 16.75, High 16.88, Low 15.57
vix-daily: Open 17.32, Close 16.82, High 17.46, Low 16.79
vix-daily: Open 16.6, Close 18.04, High 18.33, Low 16.53
vix-daily: Open 17.29, Close 16.75, High 17.3, Low 16.4
vix-daily: Open 17.07, Close 15.56, High 17.31, Low 15.49

BUT I STILL CANNOT SEE IT IN DASHBOARD, why?

It can be queried but not visible in dashboard.

答案1

得分: 0

我认为这都与如何通过时间戳筛选数据相关。

在第一个示例中，你创建了两个数据点，一个带有时间戳 "now"（即02:04:11），另一个带有未来一小时的时间戳（03:04:11）。在Python客户端的查询中，你要求获取从 "now" 开始的最后十分钟的数据（range(start: -10m），因此你查询的数据在01:54:11和02:04:11之间。只返回第一个数据点，因为第二个数据点不在此时间窗口内。然而，在仪表板上，你要求获取最后一小时的数据（请查看左侧 "script editor" 按钮旁边的下拉菜单中的时间范围筛选器，这里）。你在脚本启动后一段时间运行了查询，只查询了01:16:15和02:16:15之间的数据，因此再次排除了第二个数据点。

关于第二个示例，我对其了解不够，但你可能需要调整时间范围筛选器以包括2004年的数据。

英文:

I think it is all related on how you are filtering data through timestamps.

In the first example, you are creating two data points, one with timestamp "now" (i.e. 02:04:11) and one with timestamp one hour in the future (03:04:11). In your query through the Python client, you are asking for the last ten minutes of data since "now" (range(start: -10m)), so you are querying data between 01:54:11 and 02:04:11. Only the first datapoint is returned, because the second datapoint is not in this time window. On the dashboard, on the other hand, you are asking for the last hour of data (see the Time Range filter in the dropdown menu on the left of the "script editor" button, here). You ran the query a while after your script launch, and only data between 01:16:15 and 02:16:15 is queried, thus excluding again the second datapoint.

I'm not sure about the second example, I have too little information about it, but you will probably need to adjust the Time Range filter to include 2004 data.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

奇怪的结果，将数据框导入到 InfluxDB 2.x 时发生。

问题

答案1

将一个 QWidget 保存为图像。

如何将单个对象或集合转换为集合？

如何在Python FMX GUI应用程序中创建一个选项卡控件？

Im getting a getting Error binding parameter 3 – probably unsupported type. How can I fix this table to the right format for sqlite?

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。