机器学习每周挑战——基于时间序列的商店销售数据预测

Dataset Description

In this competition, you will predict sales for the thousands of product families sold at Favorita stores located in Ecuador. The training data includes dates, store and product information, whether that item was being promoted, as well as the sales numbers. Additional files include supplementary information that may be useful in building your models.

File Descriptions and Data Field Information

train.csv

  • The training data, comprising time series of features store_nbr , family , and onpromotion as well as the target sales.
  • store_nbr identifies the store at which the products are sold.
  • family identifies the type of product sold.
  • sales gives the total sales for a product family at a particular store at a given date. Fractional values are possible since products can be sold in fractional units (1.5 kg of cheese, for instance, as opposed to 1 bag of chips).
  • onpromotion gives the total number of items in a product family that were being promoted at a store at a given date.

test.csv

  • The test data, having the same features as the training data. You will predict the target sales for the dates in this file.
  • The dates in the test data are for the 15 days after the last date in the training data.

sample_submission.csv

  • A sample submission file in the correct format.

stores.csv

  • Store metadata, including city , state , type , and cluster.
  • cluster is a grouping of similar stores.

oil.csv

  • Daily oil price. Includes values during both the train and test data timeframes. (Ecuador is an oil-dependent country and it's economical health is highly vulnerable to shocks in oil prices.)

holidays_events.csv

  • Holidays and Events, with metadata
  • NOTE: Pay special attention to the transferred column. A holiday that is transferred officially falls on that calendar day, but was moved to another date by the government. A transferred day is more like a normal day than a holiday. To find the day that it was actually celebrated, look for the corresponding row where type is Transfer. For example, the holiday Independencia de Guayaquil was transferred from 2012-10-09 to 2012-10-12, which means it was celebrated on 2012-10-12. Days that are type Bridge are extra days that are added to a holiday (e.g., to extend the break across a long weekend). These are frequently made up by the type Work Day which is a day not normally scheduled for work (e.g., Saturday) that is meant to payback the Bridge.
  • Additional holidays are days added a regular calendar holiday, for example, as typically happens around Christmas (making Christmas Eve a holiday).

Additional Notes

  • Wages in the public sector are paid every two weeks on the 15 th and on the last day of the month. Supermarket sales could be affected by this.
  • A magnitude 7.8 earthquake struck Ecuador on April 16, 2016. People rallied in relief efforts donating water and other first need products which greatly affected supermarket sales for several weeks after the earthquake.

这里的代码是kaggle中一位大佬的代码,这里我只是看懂了代码所表达的意思,如果各位想学习一下,可以私信我要源码,或者去kaggle上找这篇原作,非常厉害的一位大佬。由于代码太多,且环境是jupyter notebook,代码块也非常多,复制粘贴太麻烦。因此我这里使用截图。

相关推荐
小南家的青蛙3 分钟前
LeetCode第2658题 - 网格图中鱼的最大数目
算法·leetcode·职场和发展
AIGC科技8 分钟前
焕新而来,境由AI生|AIRender升级更名“渲境AI”,重新定义设计渲染效率
人工智能·深度学习·图形渲染
出来吧皮卡丘12 分钟前
A2UI:让 AI Agent 自主构建用户界面的新范式
前端·人工智能·aigc
nju_spy16 分钟前
深度强化学习 TRPO 置信域策略优化实验(sb3_contrib / 手搓 + CartPole-v1 / Breakout-v5)
人工智能·强化学习·共轭梯度法·策略网络·trpo·sb3_contrib·breakout游戏
ZHang......18 分钟前
LeetCode 1114. 按序打印
java·开发语言·算法
程序员欣宸19 分钟前
LangChain4j实战之四:集成到spring-boot
java·人工智能·spring boot
cmdyu_20 分钟前
告别 LLM 输出的不确定性:深度解析 TypeChat 如何重塑 AI 工程化开发
人工智能
想你依然心痛20 分钟前
AI赋能编程语言挑战赛:从Python到Rust,我用AI大模型重塑开发效率
人工智能·python·rust
测试人社区-千羽22 分钟前
AR/VR应用测试核心要点与实施策略
人工智能·安全·职场和发展·自动驾驶·测试用例·ar·vr
人工智能技术咨询.30 分钟前
DNN案例一步步构建深层神经网络
人工智能·神经网络