PyCaret是一个开源、低代码的Python机器学习库,可自动执行机器学习工作流。它是一种端到端的机器学习和模型管理工具,可以以指数方式加快实验周期并提高您的工作效率。
与其他开源机器学习库相比,PyCaret是一个替代的低代码库,可用于仅用几行代码替换数百行代码。这使得实验速度和效率呈指数级增长。喜欢欢迎关注、收藏、点赞。
PyCaret本质上是围绕多个机器学习库和框架(例如 scikit-learn、XGBoost、LightGBM、CatBoost、spaCy、Optuna、Hyperopt、Ray 等的Python包装器。
PyCaret时间序列模块
PyCaret的新时间序列模块现已提供测试版。秉承 PyCaret 的简单性,它与现有的 API 保持一致,并带有很多功能:模型训练和选择(30多种算法)、模型分析、自动超参数调优、实验记录、云部署等。
pip install pycaret-ts-alpha
使用样例
PyCaret的时间序列模块中的工作流程非常简单。它从setup您定义预测范围,然后设置使用函数训练和评估多种算法。
加载数据
import pandas as pd
from pycaret.datasets import get_data
data = get_data('pycaret_downloads')
data['Date'] = pd.to_datetime(data['Date'])
data = data.groupby('Date').sum()
data = data.asfreq('D')
data.head()
data.plot()
data:image/s3,"s3://crabby-images/d91b1/d91b1d358a7029471b2a587a5d417b37319c2c8a" alt="图片"
初始化设置
from pycaret.time_series import *
setup(data, fh = 7, fold = 3, session_id = 123)
from pycaret.internal.pycaret_experiment import TimeSeriesExperiment
exp = TimeSeriesExperiment()
exp.setup(data, fh = 7, fold = 3, session_id = 123)
data:image/s3,"s3://crabby-images/9b31c/9b31ce5f25c1346f0c6118f81be4881e56913537" alt="图片"
统计测试
check_stats()
data:image/s3,"s3://crabby-images/18bc5/18bc5d6cc70d71aca53c4dd280ce6e49acd96239" alt="图片"
探索性数据分析
plot_model(plot = 'ts')
exp.plot_model(plot = 'ts')
data:image/s3,"s3://crabby-images/c815d/c815dbfe9a7f9d5e3941a4a95521dbc61177fddb" alt="图片"
plot_model(plot = 'cv')
data:image/s3,"s3://crabby-images/52d48/52d48ff34443b43517b6af926a909390b6bd14f3" alt="图片"
plot_model(plot = 'acf')
data:image/s3,"s3://crabby-images/b2549/b25495f4ced6c0ae4e0baec38f459be130699234" alt="图片"
plot_model(plot = 'diagnostics')
data:image/s3,"s3://crabby-images/469c8/469c8b2e42720b750be9dfc79b67c3829fa809a0" alt="图片"
plot_model(plot = 'decomp_stl')
data:image/s3,"s3://crabby-images/44c5d/44c5daeedb171339573c39788a2c92b9c627d6e3" alt="图片"
模型训练与选择
best = compare_models()
best = exp.compare_models()
data:image/s3,"s3://crabby-images/ff689/ff689cc2668882ef1bd4b947eeb6cbaa017bcf78" alt="图片"
prophet = create_model('prophet')
print(prophet)
data:image/s3,"s3://crabby-images/30cda/30cda6ff7a7dd2673ecde544a8a370606e095652" alt="图片"
tuned_prophet = tune_model(prophet)
print(tuned_prophet)
data:image/s3,"s3://crabby-images/ee015/ee01528442889d0ba76af2372330659e8ec614bc" alt="图片"
plot_model(best, plot = 'forecast')
data:image/s3,"s3://crabby-images/d6822/d68229e137689c954fbdd4c70664cdfe3e12bcfb" alt="图片"
plot_model(best, plot = 'forecast', data_kwargs = {'fh' : 30})
data:image/s3,"s3://crabby-images/31042/31042a179349bb8b3b2f41b370d309102f47fdaf" alt="图片"
plot_model(best, plot = 'insample')
data:image/s3,"s3://crabby-images/a9e3e/a9e3e06fbf3a38bb028df5e8a1ffdc337c944205" alt="图片"
plot_model(best, plot = 'residuals')
data:image/s3,"s3://crabby-images/119a9/119a9621576119bb000d83cdab625b386a1ee700" alt="图片"
测试部署
final_best = finalize_model(best)
predict_model(final_best, fh = 90)
data:image/s3,"s3://crabby-images/f6cd1/f6cd1f2c8991951493a23fe6e31201eac9d08580" alt="图片"
save_model(final_best, 'my_best_model')
data:image/s3,"s3://crabby-images/34a15/34a15f1ebd1a65ba75c2cfca13f66c546c8cf948" alt="图片"
参考资料
pycaret时序文档:https://pycaret.readthedocs.io/en/time_series/api/time_series.html
pycaret时序规划:https://github.com/pycaret/pycaret/issues/1648
|