高级检索

基于混合采样和随机森林的重过载配变识别

Research on Heavy Overload Identification of Distribution Transformer Based on Hybrid Sampling and Random Forest

  • 摘要: 受节假日、人员返乡、季节等特殊时段和配网规划、运行因素的影响,配电变压器重过载较难准确预测。针对电网实际数据采集过程中正常运行和重过载配电变压器样本比例不平衡问题导致机器学习算法无法有效辨识重过载样本,提出一种基于混合采样和随机森林的重过载配变辨识方法。首先,对正常运行样本进行聚类分层欠采样,对重过载配变样本进行SMOTETomek采样,缩小样本分布差距。其次,使用采样后的数据集训练随机森林模型。实验结果证明,混合采样能提高分类算法的性能。

     

    Abstract: Affected by special periods such as holidays, people returning home, seasons, and distribution network planning and operation factors, distribution transformers are prone to heavy overload. However, in the actual collection process, the number of heavy overloads of distribution transformers is relatively small, resulting in the inability of the classifier to effectively identify abnormal samples. Therefore, a distribution transformer heavy overload identification method based on mixed sampling and random forest is proposed. Clustering and stratified under-sampling are performed on normal samples, and SMOTETomek sampling is performed on heavy overload samples to narrow the sample distribution gap. The sampled data set is then used to train the random forest model.

     

/

返回文章
返回