引用本文: | 宋述芳,何入洋.基于随机森林的重要性测度指标体系.[J].国防科技大学学报,2021,43(2):25-32.[点击复制] |
SONG Shufang,HE Ruyang.Importance measure index system based on random forest[J].Journal of National University of Defense Technology,2021,43(2):25-32[点击复制] |
|
|
|
本文已被:浏览 6761次 下载 6275次 |
基于随机森林的重要性测度指标体系 |
宋述芳,何入洋 |
(西北工业大学 航空学院, 陕西 西安 710072)
|
摘要: |
重要性测度分析可以找出重要特征变量,从而降低输入空间的维数,节约运算成本。基于随机森林重要性测度的分析原理,探寻随机森林的重要性测度指标与基于方差的全局灵敏度指标之间的联系,得到求解方差灵敏度主指标Si及其总指标STi的新途径。建立基于随机森林的单变量、组变量重要性测度指标,并明确具体的求解过程,完善基于随机森林的重要性测度指标体系。通过算例验证了所提基于随机森林的重要性测度指标体系的有效性及其与方差灵敏度指标之间关系的正确性。 |
关键词: 随机森林 重要性测度 全局灵敏度 组变量 降维 |
DOI:10.11887/j.cn.202102004 |
投稿日期:2019-09-16 |
基金项目:国家数值风洞工程资助项目(NNW 2019ZT2-A05);国家自然科学基金资助项目(11902254) |
|
Importance measure index system based on random forest |
SONG Shufang, HE Ruyang |
(School of Aeronautics, Northwestern Polytechnical University, Xi′an 710072, China)
|
Abstract: |
The importance measure analysis can find out the important feature variables of model, which can effectively reduce the variable dimension and decrease the computation time. The relationship between the important measure of random forest and the variance-based global sensitivity measure was explored, which can give a novel way to solve variance-based main sensitivity index Si and total sensitivity index STi. The importance measure of single and group variables based on random forest were established to improve the corresponding measure index system. Several examples are given to verify the validity of the proposed important measures and the correctness relation derivation about variance-based sensitivity indices. |
Keywords: random forest importance measure global sensitivity group variables dimension-reduction |
|
|