英文:
How to append data sets when there is an overlap in data?
问题
我有两个数据集,它们都具有相同的列名。其中一个数据集是一个Excel文件,包含截止到四月的历史数据。第二个数据集链接到数据库,但始终只包含最近6个月的数据。我应该如何将这两个数据集追加在一起,以确保删除重复项?基本上,我想优先使用实时数据源的数据,但也确保数据不会丢失。
我的担忧是:在14个月后,五月份的数据会发生什么情况,因为它不在当前的Excel数据集中,也不会在实时数据集中出现?如何确保在14个月后五月份的数据不会消失?
英文:
I have two data sets and they both have the same column names. One of the data sets is an excel file and contains historical data up until April. The second data set is linked to a database but will only ever contain 6 rolling months' worth of data. How would I go about appending the two data sets, by ensuring that duplicates are removed? Basically, I'd like to prioritise the data from the live source, but also ensure that the data doesn't get lost.
My concern is: what will happen to May data in 14 months from now, since it is not in the current excel data set, and it will not be in the live data set? How do I ensure that May data doesn't disappear in 14 months' time?
答案1
得分: 1
Power Query 不支持历史数据。如果您希望保留这些数据,必须实现一些存储滚动数据的解决方案,以确保它不会“滚动”掉,可以这样说。
英文:
Power Query does not support historical data. If you want to preserve this data you must implement some solution that stores the rolling data so it doesn’t «roll off», so to speak.
通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库,让每个人都能够通过互相帮助和分享经验来进步。
评论