英文:
BTYD: Prior model tweaking
问题
我最近在使用BTYD时遇到了一个挑战,特别是在Pareto-NBD模型方面。从我阅读的Faders的论文中,使用这个模型有一些假设,首要假设是:
i) 客户在他们与特定公司的“寿命”中经历两个阶段:他们在某段时间内“存活”,然后变得永久不活跃。
“然后变得永久不活跃”这一点我们想要质疑。我目前的情况是,我们的品牌价值非常高,我们的产品单价在2到3美元左右(可以想象成糖果店)。根据我们的业务情境,我们有一个假设:“是的,我们的客户可能会不时有暂停期,但根据他们的行为,他们会再次回来,而且很可能永远不会流失”。
我之所以想要质疑这一点,是因为当我们尝试计算非常长时间(比如从计算CLV的时间算起的3年或5年)的频率的条件期望时,该模型将倾向于低估。根据在lifetimes.py包中提供的p_alive函数,经过适应我的RFM表后,我可以看到相当多的客户将在400到500天后流失。
我们想要修改这种行为,以便在“x”天后p_alive降低到某个“低值”(仍在头脑风暴中决定此值)之后,p_alive将延续到无穷大。
我理解这很可能会高估未来的访问次数,进一步高估CLV。然而,这是我尝试提出的一种方法,以弥补我们早期的问题。
如果我们修改先前的(伽马函数和一些指数函数),我应该如何修改这些分布函数,以使其适用于我的情况?
尝试过:为未来的每个月计算一个保留表,并将其设置为在足够低的值时的p_alive。
英文:
I am recently encountering a challenge with BTYD, specifically with Pareto-NBD model. See, from the papers that I read from Faders, there are few assumptions using this model, and the first and foremost is:
i) Customers go through two stages in their “lifetime” with a specific firm: they are “alive” for
some period of time, then become permanently inactive.
"then become permanently inactive" we want to challenge that. I am working in a situation where, its brand value is very high and our unit price for our products are like in 2~3 dollar ranges (think of a candy shop). We have an assumption from our business setting, that; "Yes, our customer may have a pause period every now and then, however, from their behaviors, they will come back again and most likely will never churn out".
The reason why I want to challenge that is because the model will tend to underestimate when we try to compute conditional expectation of frequency for a very long time (say.. 3 years or 5 years from point where we compute CLV). From the p_alive function provided in lifetimes.py package, after fitting to my RFM table, I could see that quite a chunk of customers will churn out after 400~500 days.
We want to modify this behavior so that the p_alive goes to some "low value" (still brainstorming to decide on this value) after "x" period of days and after that, that p_alive prolongs to infinity.
I understand that this will highly likely overshoot the future visits and furthermore overshoot CLV. However, this is sort of some method that I tried to come up to compensate our earlier question.
If we modify the prior, (Gamma function and some exponential function), how should I approach modifying these distribution functions so that it can be suitable in my situation?
Tried: calculating a retention table for every months in the future and setting that as p_alive when it sufficiently goes to low value
答案1
得分: 0
也许这个注释可以让您更好地了解帕累托/NBD模型:https://brucehardie.com/notes/031/
您是在尝试验证帕累托/NBD模型,就像Fader等人在他们的论文中所做的那样吗?我建议在深入研究问题之前,先从那里开始。我不明白您如何证明帕累托/NBD低估了CLV。
如果它确实低估了,您可以尝试一个不将第二阶段定义为“死亡”的模型。一个很好的例子是:https://faculty.wharton.upenn.edu/wp-content/uploads/2009/08/Schweidel_Fader_IJRM_2009.pdf(不幸的是,没有可用的软件包)。
英文:
Perhaps this note gives you a better insight into the Pareto/NBD model: https://brucehardie.com/notes/031/
Are you trying to validate the Pareto/NBD the way Fader et al. are doing it in their paper? I would start there before over-complicating your problem. I don't see how you have shown the Pareto/NBD is under-estimating the CLV.
If it does, you can try a model where the second stage is not 'death'. A good example is: https://faculty.wharton.upenn.edu/wp-content/uploads/2009/08/Schweidel_Fader_IJRM_2009.pdf (Sadly, no package available).
通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库,让每个人都能够通过互相帮助和分享经验来进步。
评论