Kaggle数据挖掘竞赛冠军及优胜者代码

2023-12-28 12:32

本文主要是介绍Kaggle数据挖掘竞赛冠军及优胜者代码,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!

原文:http://www.chioka.in/kaggle-competition-solutions/

We learn more from code, and from great code. Not necessarily always the 1st ranking solution, because we also learn what makes a stellar and just a good solution. I will post solutions I came upon so we can all learn to become better!

I collected the following source code and interesting discussions from the Kaggle held competitions for learning purposes. Not all competitions are listed because I am only manually collecting them, also some competitions are not listed due to no one sharing. I will add more as time goes by. Thank you.

Algorithmic Trading Challenge

  • Solution whitepaper.
  • Solution thread.

Allstate Purchase Prediction Challenge

  • Rank 2 solution code by Alessandro Mariani.
  • Rank 10 solution code by B1aine.
  • Rank 36 solution code by Hiroyuki.
  • Rank 159 solution code by MrCanard.
  • Solution thread.

Amazon.com – Employee Access Challenge

  • Rank 1 solution code by Paul Duan and Benjamin Solecki team.
  • Rank 1 solution Q&A by Paul Duan.
  • Rank 2 solution code by Owen Zhang.
  • Rank 3 solution code by Dmitry & Leustagos.
  • Rank 289 solution code by Foxtrot with original blog post here.
  • Solution thread.

AMS 2013-2014 Solar Energy Prediction Contest

  • Rank 1 solution code and description by Leustagos team.
  • Rank 2 solution code and description by Toulouse.
  • Rank 3 solution code and description by Owen Zhang.
  • Rank 4 solution escription by Peter Prettenhofer.
  • Rank 5 solution description by Domcastro.
  • Rank 58 solution code and description by Davit.
  • Solution thread here.
  • Ridge Regression starter code with MAE about 2.2M by Alec Radford, original thread here.
  • Improved starter code by Foxtrot.
  • Baseline code with MAE about 2.6M using Catmull-Rom Spline interpolation, also available in R here and here.

Belkin Energy Disaggregation Competition

  • Solution thread.

Challenges in Representation Learning: Facial Expression Recognition Challenge

  • Rank 1 solution code and description by Charlie Tang.
  • Rank 3 solution description by Maxim Milakov.
  • Solution thread.

Challenges in Representation Learning: The Black Box Learning Challenge

  • Rank 1 solution description by David Thaler.
  • Rank 2 solution code and description by sayit.

Challenges in Representation Learning: Multi-modal Learning

  • Rank 1 solution by MMDL.
  • Solution thread.

Detecting Insults in Social Commentary

  • Rank 1 solution description and code by Vivek Sharma.
  • Rank 2 solution by tuzzeg.
  • Rank 3 solution description Andrei Olariu.
  • Rank 4 solution by Chris Brew.
  • Rank 5 solution description by Yasser Tabandeh.
  • Rank 6 solution by Andreas Mueller, code available here.
  • Rank 8 solution description by Steve Poulson.
  • Solution thread.

EMI Music Data Science Hackathon

  • Rank 4 solution description by Steffen Rindle.
  • Rank 18 solution code and description by Vlad Gusev.
  • Rank 34 solution code and description by zenog.
  • Solution thread.

Galaxy Zoo – The Galaxy Challenge

  • Rank 1 solution code and description by Sander Dieleman.
  • Rank 2 solution code and description by Maxim Milakov.
  • Rank 3 solution code and description by tund.
  • Rank 5 solution code and description by Julian de Wit.
  • Rank 9 solution code and description by Soumith Chintala.
  • Rank 13 solution code and description by Xiaoxiang Zhang.
  • Rank 28 solution code and description by utdiscant.
  • Rank 38 solution code and description by sugi.
  • Rank 57 solution code and description by hxu.
  • Rank 58 solution code and description by yr.
  • Solution thread.

Global Energy Forecasting Competition 2012 – Wind Forecasting

  • Rank 1 solution by Leustagos.
  • Solution thread here.

KDD Cup 2013 – Author-Paper Identification Challenge (Track 1)

  • Rank 1 solution with code and description by Team Algorithm, Github link to code here.

KDD Cup 2013 – Author Disambiguation Challenge (Track 2)

  • Rank 1 solution with code and description by Team Algorithm, Github link to code here.
  • Rank 2 solution by SmallData Team.
  • Rank 3 solution by hustmonk.
  • Rank 4 solution by Ben S.
  • Solution thread.

Large Scale Hierarchical Text Classification

  • Rank 1 solution code and description by anttip.
  • Rank 3 solution code and description by nagadomi.
  • Solution thread one.
  • Solution thread two.

Loan Default Prediction – Imperial College London

  • Rank 2 solution and description by HelloWorld.
  • Rank 12 solution and description by David McGarry.
  • Solution thread.

Merck Molecular Activity Challenge

  • Ideas sharing discussion thread.
  • Preprocessing techniques discussion thread.

MLSP 2013 Bird Classification Challenge

  • Rank 1 solution code and description by beluga.
  • Rank 2 solution code and description by Herbal Candy (W and thomeou).
  • Rank 3 solution description by Anil Thomas.
  • Rank 4 solution description by Maxim Milakov.
  • Solution thread.

Observing the Dark World

  • Rank 2 solution by Iain Murray, code available here.

PAKDD 2014 – ASUS Malfunctional Components Prediction

  • Yet another solution thread.
  • Solution thread.

Personalize Expedia Hotel Searches – ICDM 2013

  • Presentation paper/slides for ICDM 2013.
  • Solution thread.

Predicting a Biological Response

  • Rank 6 solution by Shea Parkes & Neil Schneider team.
  • Rank 17 solution of Ensemble of RandomForests, GradientBoostingTrees and ExtraTreesRegressor by Emanuele Olivetti.
    • Another solution code by Oblique Random Forest (oRF) by Shea Parkes & Neil Schneider team.
    • The code of my best submission thread. Talks about Multi-core training Oblique Random Forests, and Stacking.
    • Question about the process of ensemble learning thread. Talks about applying ensembles in practice, and how can problems arise and how to deal with them.

    Predicting Closed Questions on Stack Overflow

    • Rank 10 solution by Marco Lui.
    • Rank 33 solution by Foxtrot.

    See Click Predict Fix

    • Solution thread.

    See Click Predict Fix – Hackathon

    • Solution thread.

    StumbleUpon Evergreen Classification Challenge

    • Benchmark beater 1.
    • Benchmark beater 2.
    • Benchmark beater 3.
    • Solution thread.
    • My own solution, which is a good example of what is overfitting. (Public rank: 57, Private rank: 291)

    The Analytics Edge (15.071x)

    • Rank 17 solution code and description by Foxtrot.
    • Solution thread.

    The Marinexplore and Cornell University Whale Detection Challenge

    • Rank 1 solution by Nick Kridler.
    • Rank 7 solution by Gilles Louppe and Peter Prettenhofer team.
    • Rank 8 solution by Sander Dieleman.
    • Rank 56 solution by Sudeep Juvekar.
    • Solution discussion thread.
    • Mean spectogram thread.
    • Official interview from the Marinexplorer and Cornell at Kaggle.

    Walmart Recruiting – Store Sales Forecasting

    • Rank 1 solution code and description by David Thaler.
    • Rank 2 solution description by sriok.
    • Rank 3 solution code and description by James King.
    • Rank 5 solution description by ACS69.
    • Rank 6 solution description by T. Henry.
    • Rank 8 solution description by BreakfastPirate.
    • Rank 9 solution description by Neil Summers.
    • Rank 10 solution description by Gilberto Titericz Junior.
    • Rank 11 solution description by citynight.
    • Rank 16 solution code and description by yr.
    • Rank 29 solution code and description by Mike Kim.
    • Rank 30 solution description by dkay.
    • Solution thread.

    Thank you Foxtrot, James Petterson, Ben S for providing some of the links and solutions above.

这篇关于Kaggle数据挖掘竞赛冠军及优胜者代码的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!



http://www.chinasem.cn/article/545944

相关文章

Java中调用数据库存储过程的示例代码

《Java中调用数据库存储过程的示例代码》本文介绍Java通过JDBC调用数据库存储过程的方法,涵盖参数类型、执行步骤及数据库差异,需注意异常处理与资源管理,以优化性能并实现复杂业务逻辑,感兴趣的朋友... 目录一、存储过程概述二、Java调用存储过程的基本javascript步骤三、Java调用存储过程示

Visual Studio 2022 编译C++20代码的图文步骤

《VisualStudio2022编译C++20代码的图文步骤》在VisualStudio中启用C++20import功能,需设置语言标准为ISOC++20,开启扫描源查找模块依赖及实验性标... 默认创建Visual Studio桌面控制台项目代码包含C++20的import方法。右键项目的属性:

MySQL数据库的内嵌函数和联合查询实例代码

《MySQL数据库的内嵌函数和联合查询实例代码》联合查询是一种将多个查询结果组合在一起的方法,通常使用UNION、UNIONALL、INTERSECT和EXCEPT关键字,下面:本文主要介绍MyS... 目录一.数据库的内嵌函数1.1聚合函数COUNT([DISTINCT] expr)SUM([DISTIN

Java实现自定义table宽高的示例代码

《Java实现自定义table宽高的示例代码》在桌面应用、管理系统乃至报表工具中,表格(JTable)作为最常用的数据展示组件,不仅承载对数据的增删改查,还需要配合布局与视觉需求,而JavaSwing... 目录一、项目背景详细介绍二、项目需求详细介绍三、相关技术详细介绍四、实现思路详细介绍五、完整实现代码

Go语言代码格式化的技巧分享

《Go语言代码格式化的技巧分享》在Go语言的开发过程中,代码格式化是一个看似细微却至关重要的环节,良好的代码格式化不仅能提升代码的可读性,还能促进团队协作,减少因代码风格差异引发的问题,Go在代码格式... 目录一、Go 语言代码格式化的重要性二、Go 语言代码格式化工具:gofmt 与 go fmt(一)

HTML5实现的移动端购物车自动结算功能示例代码

《HTML5实现的移动端购物车自动结算功能示例代码》本文介绍HTML5实现移动端购物车自动结算,通过WebStorage、事件监听、DOM操作等技术,确保实时更新与数据同步,优化性能及无障碍性,提升用... 目录1. 移动端购物车自动结算概述2. 数据存储与状态保存机制2.1 浏览器端的数据存储方式2.1.

基于 HTML5 Canvas 实现图片旋转与下载功能(完整代码展示)

《基于HTML5Canvas实现图片旋转与下载功能(完整代码展示)》本文将深入剖析一段基于HTML5Canvas的代码,该代码实现了图片的旋转(90度和180度)以及旋转后图片的下载... 目录一、引言二、html 结构分析三、css 样式分析四、JavaScript 功能实现一、引言在 Web 开发中,

Python如何去除图片干扰代码示例

《Python如何去除图片干扰代码示例》图片降噪是一个广泛应用于图像处理的技术,可以提高图像质量和相关应用的效果,:本文主要介绍Python如何去除图片干扰的相关资料,文中通过代码介绍的非常详细,... 目录一、噪声去除1. 高斯噪声(像素值正态分布扰动)2. 椒盐噪声(随机黑白像素点)3. 复杂噪声(如伪

Java Spring ApplicationEvent 代码示例解析

《JavaSpringApplicationEvent代码示例解析》本文解析了Spring事件机制,涵盖核心概念(发布-订阅/观察者模式)、代码实现(事件定义、发布、监听)及高级应用(异步处理、... 目录一、Spring 事件机制核心概念1. 事件驱动架构模型2. 核心组件二、代码示例解析1. 事件定义

Python实例题之pygame开发打飞机游戏实例代码

《Python实例题之pygame开发打飞机游戏实例代码》对于python的学习者,能够写出一个飞机大战的程序代码,是不是感觉到非常的开心,:本文主要介绍Python实例题之pygame开发打飞机... 目录题目pygame-aircraft-game使用 Pygame 开发的打飞机游戏脚本代码解释初始化部