高级检索

基于混合采样和随机森林的重过载配变识别

Research on Heavy Overload Identification of Distribution Transformer Based on Hybrid Sampling and Random Forest

  • 摘要: 受节假日、人员返乡、季节等特殊时段和配网规划、运行因素的影响,配电变压器重过载较难准确预测。重过载不仅损害变压器的寿命,还会导致设备烧毁,影响供电稳定性和可靠性。针对电网实际数据采集过程中正常运行和重过载配电变压器样本比例不平衡问题导致机器学习算法无法有效辨识重过载样本,提出一种基于混合采样和随机森林的重过载配变辨识方法。首先,对正常运行样本进行聚类分层欠采样,对重过载配变样本进行SMOTETomek采样,缩小样本分布差距。其次,使用采样后的数据集训练随机森林模型。实验结果证明,混合采样能提高分类算法的性能。所提方法可为电网运维人员识别重过载配变提供支持。

     

    Abstract: Affected by special periods such as holidays, people returning home, seasons, and distribution network planning and operation factors, distribution transformers are prone to heavy overload. The existence of heavy overload will not only damage the life of the transformer, but also cause the equipment to burn out, affecting the stability and reliability of power supply. However, in the actual collection process, the number of heavy overloads of distribution transformers is relatively small, resulting in the inability of the classifier to effectively identify abnormal samples. Therefore, a distribution transformer heavy overload identification method based on mixed sampling and random forest is proposed. Clustering and stratified under-sampling are performed on normal samples, and SMOTETomek sampling is performed on heavy overload samples to narrow the sample distribution gap. The sampled data set is then used to train the random forest model. Experimental results show that the proposed sampling method can improve the performance of the classifier to a certain extent.

     

/

返回文章
返回