Kaggle数据挖掘竞赛冠军及优胜者代码

2023-12-28 12:32

本文主要是介绍Kaggle数据挖掘竞赛冠军及优胜者代码,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!

原文:http://www.chioka.in/kaggle-competition-solutions/

We learn more from code, and from great code. Not necessarily always the 1st ranking solution, because we also learn what makes a stellar and just a good solution. I will post solutions I came upon so we can all learn to become better!

I collected the following source code and interesting discussions from the Kaggle held competitions for learning purposes. Not all competitions are listed because I am only manually collecting them, also some competitions are not listed due to no one sharing. I will add more as time goes by. Thank you.

Algorithmic Trading Challenge

  • Solution whitepaper.
  • Solution thread.

Allstate Purchase Prediction Challenge

  • Rank 2 solution code by Alessandro Mariani.
  • Rank 10 solution code by B1aine.
  • Rank 36 solution code by Hiroyuki.
  • Rank 159 solution code by MrCanard.
  • Solution thread.

Amazon.com – Employee Access Challenge

  • Rank 1 solution code by Paul Duan and Benjamin Solecki team.
  • Rank 1 solution Q&A by Paul Duan.
  • Rank 2 solution code by Owen Zhang.
  • Rank 3 solution code by Dmitry & Leustagos.
  • Rank 289 solution code by Foxtrot with original blog post here.
  • Solution thread.

AMS 2013-2014 Solar Energy Prediction Contest

  • Rank 1 solution code and description by Leustagos team.
  • Rank 2 solution code and description by Toulouse.
  • Rank 3 solution code and description by Owen Zhang.
  • Rank 4 solution escription by Peter Prettenhofer.
  • Rank 5 solution description by Domcastro.
  • Rank 58 solution code and description by Davit.
  • Solution thread here.
  • Ridge Regression starter code with MAE about 2.2M by Alec Radford, original thread here.
  • Improved starter code by Foxtrot.
  • Baseline code with MAE about 2.6M using Catmull-Rom Spline interpolation, also available in R here and here.

Belkin Energy Disaggregation Competition

  • Solution thread.

Challenges in Representation Learning: Facial Expression Recognition Challenge

  • Rank 1 solution code and description by Charlie Tang.
  • Rank 3 solution description by Maxim Milakov.
  • Solution thread.

Challenges in Representation Learning: The Black Box Learning Challenge

  • Rank 1 solution description by David Thaler.
  • Rank 2 solution code and description by sayit.

Challenges in Representation Learning: Multi-modal Learning

  • Rank 1 solution by MMDL.
  • Solution thread.

Detecting Insults in Social Commentary

  • Rank 1 solution description and code by Vivek Sharma.
  • Rank 2 solution by tuzzeg.
  • Rank 3 solution description Andrei Olariu.
  • Rank 4 solution by Chris Brew.
  • Rank 5 solution description by Yasser Tabandeh.
  • Rank 6 solution by Andreas Mueller, code available here.
  • Rank 8 solution description by Steve Poulson.
  • Solution thread.

EMI Music Data Science Hackathon

  • Rank 4 solution description by Steffen Rindle.
  • Rank 18 solution code and description by Vlad Gusev.
  • Rank 34 solution code and description by zenog.
  • Solution thread.

Galaxy Zoo – The Galaxy Challenge

  • Rank 1 solution code and description by Sander Dieleman.
  • Rank 2 solution code and description by Maxim Milakov.
  • Rank 3 solution code and description by tund.
  • Rank 5 solution code and description by Julian de Wit.
  • Rank 9 solution code and description by Soumith Chintala.
  • Rank 13 solution code and description by Xiaoxiang Zhang.
  • Rank 28 solution code and description by utdiscant.
  • Rank 38 solution code and description by sugi.
  • Rank 57 solution code and description by hxu.
  • Rank 58 solution code and description by yr.
  • Solution thread.

Global Energy Forecasting Competition 2012 – Wind Forecasting

  • Rank 1 solution by Leustagos.
  • Solution thread here.

KDD Cup 2013 – Author-Paper Identification Challenge (Track 1)

  • Rank 1 solution with code and description by Team Algorithm, Github link to code here.

KDD Cup 2013 – Author Disambiguation Challenge (Track 2)

  • Rank 1 solution with code and description by Team Algorithm, Github link to code here.
  • Rank 2 solution by SmallData Team.
  • Rank 3 solution by hustmonk.
  • Rank 4 solution by Ben S.
  • Solution thread.

Large Scale Hierarchical Text Classification

  • Rank 1 solution code and description by anttip.
  • Rank 3 solution code and description by nagadomi.
  • Solution thread one.
  • Solution thread two.

Loan Default Prediction – Imperial College London

  • Rank 2 solution and description by HelloWorld.
  • Rank 12 solution and description by David McGarry.
  • Solution thread.

Merck Molecular Activity Challenge

  • Ideas sharing discussion thread.
  • Preprocessing techniques discussion thread.

MLSP 2013 Bird Classification Challenge

  • Rank 1 solution code and description by beluga.
  • Rank 2 solution code and description by Herbal Candy (W and thomeou).
  • Rank 3 solution description by Anil Thomas.
  • Rank 4 solution description by Maxim Milakov.
  • Solution thread.

Observing the Dark World

  • Rank 2 solution by Iain Murray, code available here.

PAKDD 2014 – ASUS Malfunctional Components Prediction

  • Yet another solution thread.
  • Solution thread.

Personalize Expedia Hotel Searches – ICDM 2013

  • Presentation paper/slides for ICDM 2013.
  • Solution thread.

Predicting a Biological Response

  • Rank 6 solution by Shea Parkes & Neil Schneider team.
  • Rank 17 solution of Ensemble of RandomForests, GradientBoostingTrees and ExtraTreesRegressor by Emanuele Olivetti.
    • Another solution code by Oblique Random Forest (oRF) by Shea Parkes & Neil Schneider team.
    • The code of my best submission thread. Talks about Multi-core training Oblique Random Forests, and Stacking.
    • Question about the process of ensemble learning thread. Talks about applying ensembles in practice, and how can problems arise and how to deal with them.

    Predicting Closed Questions on Stack Overflow

    • Rank 10 solution by Marco Lui.
    • Rank 33 solution by Foxtrot.

    See Click Predict Fix

    • Solution thread.

    See Click Predict Fix – Hackathon

    • Solution thread.

    StumbleUpon Evergreen Classification Challenge

    • Benchmark beater 1.
    • Benchmark beater 2.
    • Benchmark beater 3.
    • Solution thread.
    • My own solution, which is a good example of what is overfitting. (Public rank: 57, Private rank: 291)

    The Analytics Edge (15.071x)

    • Rank 17 solution code and description by Foxtrot.
    • Solution thread.

    The Marinexplore and Cornell University Whale Detection Challenge

    • Rank 1 solution by Nick Kridler.
    • Rank 7 solution by Gilles Louppe and Peter Prettenhofer team.
    • Rank 8 solution by Sander Dieleman.
    • Rank 56 solution by Sudeep Juvekar.
    • Solution discussion thread.
    • Mean spectogram thread.
    • Official interview from the Marinexplorer and Cornell at Kaggle.

    Walmart Recruiting – Store Sales Forecasting

    • Rank 1 solution code and description by David Thaler.
    • Rank 2 solution description by sriok.
    • Rank 3 solution code and description by James King.
    • Rank 5 solution description by ACS69.
    • Rank 6 solution description by T. Henry.
    • Rank 8 solution description by BreakfastPirate.
    • Rank 9 solution description by Neil Summers.
    • Rank 10 solution description by Gilberto Titericz Junior.
    • Rank 11 solution description by citynight.
    • Rank 16 solution code and description by yr.
    • Rank 29 solution code and description by Mike Kim.
    • Rank 30 solution description by dkay.
    • Solution thread.

    Thank you Foxtrot, James Petterson, Ben S for providing some of the links and solutions above.

这篇关于Kaggle数据挖掘竞赛冠军及优胜者代码的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!



http://www.chinasem.cn/article/545944

相关文章

SpringBoot+RustFS 实现文件切片极速上传的实例代码

《SpringBoot+RustFS实现文件切片极速上传的实例代码》本文介绍利用SpringBoot和RustFS构建高性能文件切片上传系统,实现大文件秒传、断点续传和分片上传等功能,具有一定的参考... 目录一、为什么选择 RustFS + SpringBoot?二、环境准备与部署2.1 安装 RustF

Python实现Excel批量样式修改器(附完整代码)

《Python实现Excel批量样式修改器(附完整代码)》这篇文章主要为大家详细介绍了如何使用Python实现一个Excel批量样式修改器,文中的示例代码讲解详细,感兴趣的小伙伴可以跟随小编一起学习一... 目录前言功能特性核心功能界面特性系统要求安装说明使用指南基本操作流程高级功能技术实现核心技术栈关键函

Redis实现高效内存管理的示例代码

《Redis实现高效内存管理的示例代码》Redis内存管理是其核心功能之一,为了高效地利用内存,Redis采用了多种技术和策略,如优化的数据结构、内存分配策略、内存回收、数据压缩等,下面就来详细的介绍... 目录1. 内存分配策略jemalloc 的使用2. 数据压缩和编码ziplist示例代码3. 优化的

Python 基于http.server模块实现简单http服务的代码举例

《Python基于http.server模块实现简单http服务的代码举例》Pythonhttp.server模块通过继承BaseHTTPRequestHandler处理HTTP请求,使用Threa... 目录测试环境代码实现相关介绍模块简介类及相关函数简介参考链接测试环境win11专业版python

Python从Word文档中提取图片并生成PPT的操作代码

《Python从Word文档中提取图片并生成PPT的操作代码》在日常办公场景中,我们经常需要从Word文档中提取图片,并将这些图片整理到PowerPoint幻灯片中,手动完成这一任务既耗时又容易出错,... 目录引言背景与需求解决方案概述代码解析代码核心逻辑说明总结引言在日常办公场景中,我们经常需要从 W

使用Spring Cache本地缓存示例代码

《使用SpringCache本地缓存示例代码》缓存是提高应用程序性能的重要手段,通过将频繁访问的数据存储在内存中,可以减少数据库访问次数,从而加速数据读取,:本文主要介绍使用SpringCac... 目录一、Spring Cache简介核心特点:二、基础配置1. 添加依赖2. 启用缓存3. 缓存配置方案方案

MySQL的配置文件详解及实例代码

《MySQL的配置文件详解及实例代码》MySQL的配置文件是服务器运行的重要组成部分,用于设置服务器操作的各种参数,下面:本文主要介绍MySQL配置文件的相关资料,文中通过代码介绍的非常详细,需要... 目录前言一、配置文件结构1.[mysqld]2.[client]3.[mysql]4.[mysqldum

Python多线程实现大文件快速下载的代码实现

《Python多线程实现大文件快速下载的代码实现》在互联网时代,文件下载是日常操作之一,尤其是大文件,然而,网络条件不稳定或带宽有限时,下载速度会变得很慢,本文将介绍如何使用Python实现多线程下载... 目录引言一、多线程下载原理二、python实现多线程下载代码说明:三、实战案例四、注意事项五、总结引

IDEA与MyEclipse代码量统计方式

《IDEA与MyEclipse代码量统计方式》文章介绍在项目中不安装第三方工具统计代码行数的方法,分别说明MyEclipse通过正则搜索(排除空行和注释)及IDEA使用Statistic插件或调整搜索... 目录项目场景MyEclipse代码量统计IDEA代码量统计总结项目场景在项目中,有时候我们需要统计

MySQL设置密码复杂度策略的完整步骤(附代码示例)

《MySQL设置密码复杂度策略的完整步骤(附代码示例)》MySQL密码策略还可能包括密码复杂度的检查,如是否要求密码包含大写字母、小写字母、数字和特殊字符等,:本文主要介绍MySQL设置密码复杂度... 目录前言1. 使用 validate_password 插件1.1 启用 validate_passwo