Scalable Supervised Dimensionality Reduction Using Clustering

被引:0
|
作者
Raeder, Troy [1 ]
Perlich, Claudia [1 ]
Dalessandro, Brian [1 ]
Stitelman, Ori [1 ]
Provost, Foster [2 ,3 ]
机构
[1] m6d Res, 37 E 18th St, New York, NY 10003 USA
[2] NYU, New York, NY 10012 USA
[3] m6d Res, New York, NY 10012 USA
关键词
supervised dimensionality reduction; clustering; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The automated targeting of online display ads at scale requires the simultaneous evaluation of a single prospect against many independent models. When deciding which ad to show to a user, one must calculate likelihood-to-convert scores for that user across all potential advertisers in the system. For modern machine-learning-based targeting, as conducted by Media6Degrees (m6d), this can mean scoring against thousands of models in a large, sparse feature space. Dimensionality reduction within this space is useful, as it decreases scoring time and model storage requirements. To meet this need, we develop a novel algorithm for scalable supervised dimensionality reduction across hundreds of simultaneous classification tasks. The algorithm performs hierarchical clustering in the space of model parameters from historical models in order to collapse related features into a single dimension. This allows us to implicitly incorporate feature and label data across all tasks without operating directly in a massive space. We present experimental results showing that for this task our algorithm outperforms other popular dimensionality-reduction algorithms across a wide variety of ad campaigns, as well as production results that showcase its performance in practice.
引用
收藏
页码:1213 / 1221
页数:9
相关论文
共 50 条
  • [1] A scalable supervised algorithm for dimensionality reduction on streaming data
    Yan, Jun
    Zhang, Benyu
    Yan, Shuicheng
    Liu, Ning
    Yang, Qiang
    Cheng, Qiansheng
    Li, Hua
    Chen, Zheng
    Ma, Wei-Ying
    INFORMATION SCIENCES, 2006, 176 (14) : 2042 - 2065
  • [2] Supervised and Unsupervised Clustering Based Dimensionality Reduction of Hyperspectral Data
    Beirami, B. A.
    Mokhtarzade, M.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2021, 34 (06): : 1407 - 1412
  • [3] A Framework for Semi-Supervised Clustering Based on Dimensionality Reduction
    Cui Peng
    Zhang Ru-bo
    FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 192 - +
  • [4] Patent Document Clustering Using Dimensionality Reduction
    Girthana, K.
    Swamynathan, S.
    PROGRESS IN ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, VOL 2, 2018, 564 : 167 - 176
  • [5] Bayesian Supervised Dimensionality Reduction
    Gonen, Mehmet
    IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (06) : 2179 - 2189
  • [6] Arabic text clustering using improved clustering algorithms with dimensionality reduction
    Arun Kumar Sangaiah
    Ahmed E. Fakhry
    Mohamed Abdel-Basset
    Ibrahim El-henawy
    Cluster Computing, 2019, 22 : 4535 - 4549
  • [7] Arabic text clustering using improved clustering algorithms with dimensionality reduction
    Sangaiah, Arun Kumar
    Fakhry, Ahmed E.
    Abdel-Basset, Mohamed
    El-henawy, Ibrahim
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02): : S4535 - S4549
  • [8] Semi-supervised dimensionality reduction using orthogonal projection divergence-based clustering for hyperspectral imagery
    Su, Hongjun
    Du, Peijun
    Du, Qian
    OPTICAL ENGINEERING, 2012, 51 (11)
  • [9] Supervised Dimensionality Reduction and Visualization using Centroid-Encoder
    Ghosh, Tomojit
    Kirby, Michael
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23 : 1 - 34
  • [10] Supervised Dimensionality Reduction and Visualization using Centroid-Encoder
    Ghosh, Tomojit
    Kirby, Michael
    Journal of Machine Learning Research, 2022, 23