如何在数据重叠时追加数据集?

huangapple go评论55阅读模式
英文:

How to append data sets when there is an overlap in data?

问题

我有两个数据集,它们都具有相同的列名。其中一个数据集是一个Excel文件,包含截止到四月的历史数据。第二个数据集链接到数据库,但始终只包含最近6个月的数据。我应该如何将这两个数据集追加在一起,以确保删除重复项?基本上,我想优先使用实时数据源的数据,但也确保数据不会丢失。

我的担忧是:在14个月后,五月份的数据会发生什么情况,因为它不在当前的Excel数据集中,也不会在实时数据集中出现?如何确保在14个月后五月份的数据不会消失?

英文:

I have two data sets and they both have the same column names. One of the data sets is an excel file and contains historical data up until April. The second data set is linked to a database but will only ever contain 6 rolling months' worth of data. How would I go about appending the two data sets, by ensuring that duplicates are removed? Basically, I'd like to prioritise the data from the live source, but also ensure that the data doesn't get lost.

My concern is: what will happen to May data in 14 months from now, since it is not in the current excel data set, and it will not be in the live data set? How do I ensure that May data doesn't disappear in 14 months' time?

答案1

得分: 1

Power Query 不支持历史数据。如果您希望保留这些数据,必须实现一些存储滚动数据的解决方案,以确保它不会“滚动”掉,可以这样说。

英文:

Power Query does not support historical data. If you want to preserve this data you must implement some solution that stores the rolling data so it doesn’t «roll off», so to speak.

huangapple
  • 本文由 发表于 2023年6月22日 13:10:44
  • 转载请务必保留本文链接:https://go.coder-hub.com/76528755.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定