123,123

基于混合偏好分析的联邦推荐框架

网络安全与数据治理 10期

宋雨伦，李大中，张丝雨，庄媛，崔玲龙，王功举，闫龙

（联通数字科技有限公司数据智能事业部，北京100013）

摘要： 针对基于内容推荐算法存在过拟合及过度专业化问题，提出了一种混合个体及群体偏好分析的联邦推荐框架，该框架包含三个模块：基于随机森林的偏好分析模块、基于联邦的相似兴趣用户分群模块、基于混合偏好分析的推荐模块。在基于随机森林的偏好分析模块中，提出考虑组合因素的同一属性间偏好度分析和结合信息熵的不同属性间在意度分析方法,通过纠正与真实兴趣属性具有强关联性的非兴趣属性影响，初步解决了算法的过拟合问题；在基于联邦的相似兴趣用户分群模块和混合偏好分析的推荐模块中，通过基于联邦的相似兴趣用户分群模块和群体的同属性异属性分析，得到用户所在兴趣群体的同属性异属性偏好信息，并通过网格遍历群体和个体的超参权重混合进行资源推荐，解决了推荐系统存在的过度专业化问题，提高算法在特定任务上的表现。实验结果表明所提算法相较于随机森林算法在各方面指标上有明显提升，且提高了推荐系统的可解释性。

關(guān)鍵詞： 推荐算法偏好分析基于内容推荐联邦学习

中圖分類號：TP393
文獻(xiàn)標(biāo)識碼：A
DOI:10.19358/j.issn.2097-1788.2023.10.003
引用格式：宋雨倫，李大中，張絲雨，等.基于混合偏好分析的聯(lián)邦推薦框架［J］.網(wǎng)絡(luò)安全與數(shù)據(jù)治理，2023，42（10）：16-22.

Mixed preference analysis federated recommendation framework

Song Yulun, Li Dazhong, Zhang Siyu,Zhuang Yuan, Cui Linglong, Wang Gongju, Yan Long

(Data Intelligence Division， Unicom Digital Tech Co., Ltd., Beijing 100013, China)

Abstract： Targeting to solve the overfitting and overspecialization problems existing in contentbased recommendation algorithms, this paper proposed a Mixed Preference Analysis Federated Recommendation Framework (MPAFRF), which includes three modules: preference analysis module, federated interest grouping module and recommendation module with mixed preference analysis. In preference analysis module, a method to calculate the preference score of same and different attributes of a resource is proposed, mainly to solve the overfitting problem. The proposed framework overcames the overspecialization problem by first choosing a group, then assigning a preference score for each resource considering a mixed preference. And it uses grid search algorithm to find the best hyperparameter considering individual and group preference. The experimental results show that the proposed recommendation framework is enhanced in both performance and interpretability. The performance of the algorithm on specific task demonstrates that compared with the random forest algorithm, the framework proposed in this paper has a significant improvement in various indicators, and improves the interpretability of the recommendation system.

Key words : recommendation algorithm; preference analysis; contentbased recommendation; federated learning

0 引言

由于互聯(lián)網(wǎng)在規(guī)模和覆蓋等層面的高速發(fā)展，信息過載使得用戶找到對自己有用的信息變得困難，從而使得信息利用效率降低；同時讓商家無法找到真正的目標(biāo)群體［1］，且盲目擬合數(shù)據(jù)而不考慮固有偏差將導(dǎo)致許多嚴(yán)重問題，例如線下評估和線上指標(biāo)之間的差異，損害用戶的滿意度等［2］。所以，如何提高推薦算法的精確性、可解釋性、多樣性等以獲得信息更高的使用效率和用戶更多的信任都是推薦系統(tǒng)研究的重點(diǎn)。

近年來，我國高度重視數(shù)字經(jīng)濟(jì)高質(zhì)量發(fā)展，同時統(tǒng)籌發(fā)展與安全，相繼出臺了多部相關(guān)法律法規(guī)，保障網(wǎng)絡(luò)空間數(shù)據(jù)流通的安全及合規(guī)。聯(lián)邦學(xué)習(xí)技術(shù)有助于解決多方數(shù)據(jù)合作中面臨的兩大挑戰(zhàn)：一是數(shù)據(jù)安全難以得到保障，隱私數(shù)據(jù)泄露問題亟待解決；二是由于網(wǎng)絡(luò)安全隔離和行業(yè)隱私，不同行業(yè)、部門之間存在數(shù)據(jù)壁壘，導(dǎo)致數(shù)據(jù)形成“孤島”無法安全共享［3］。

本文在保障數(shù)據(jù)使用安全合規(guī)的前提下，結(jié)合聯(lián)邦建模技術(shù)，提出了一種混合個體及群體偏好分析的聯(lián)邦推薦框架。本文主要貢獻(xiàn)如下：

（1）本文提出考慮組合因素的同一屬性間偏好度分析和結(jié)合信息熵的不同屬性間在意度分析方法,通過糾正與真實(shí)興趣屬性具有強(qiáng)關(guān)聯(lián)性的非興趣屬性影響，初步解決了算法的過擬合問題，增強(qiáng)了算法的可解釋性。

（2）本文提出了一種混合偏好分析的聯(lián)邦推薦框架（Mixed Preference Analysis Federated Recommendation Framework, MPAFRF），通過基于聯(lián)邦的相似興趣用戶分群模塊找到用戶所在興趣群，結(jié)合個體和群體超參權(quán)重組合、個體偏好分?jǐn)?shù)集合、個體所在群體偏好分?jǐn)?shù)集合進(jìn)行資源推薦，緩解了推薦系統(tǒng)存在的過度專業(yè)化問題，提高了算法在特定任務(wù)上的表現(xiàn)。

本文詳細(xì)內(nèi)容請下載：http://m.ihrv.cn/resource/share/2000005733

作者信息：

宋雨倫，李大中，張絲雨，莊媛，崔玲龍，王功舉，閆龍

（聯(lián)通數(shù)字科技有限公司數(shù)據(jù)智能事業(yè)部，北京100013）

微信圖片_20210517164139.jpg

原創(chuàng)聲明：此內(nèi)容為AET網(wǎng)站原創(chuàng)，未經(jīng)授權(quán)禁止轉(zhuǎn)載。

相關(guān)內(nèi)容