Kaggle数据挖掘竞赛冠军及优胜者代码

2023-12-28 12:32

本文主要是介绍Kaggle数据挖掘竞赛冠军及优胜者代码,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!

原文:http://www.chioka.in/kaggle-competition-solutions/

We learn more from code, and from great code. Not necessarily always the 1st ranking solution, because we also learn what makes a stellar and just a good solution. I will post solutions I came upon so we can all learn to become better!

I collected the following source code and interesting discussions from the Kaggle held competitions for learning purposes. Not all competitions are listed because I am only manually collecting them, also some competitions are not listed due to no one sharing. I will add more as time goes by. Thank you.

Algorithmic Trading Challenge

  • Solution whitepaper.
  • Solution thread.

Allstate Purchase Prediction Challenge

  • Rank 2 solution code by Alessandro Mariani.
  • Rank 10 solution code by B1aine.
  • Rank 36 solution code by Hiroyuki.
  • Rank 159 solution code by MrCanard.
  • Solution thread.

Amazon.com – Employee Access Challenge

  • Rank 1 solution code by Paul Duan and Benjamin Solecki team.
  • Rank 1 solution Q&A by Paul Duan.
  • Rank 2 solution code by Owen Zhang.
  • Rank 3 solution code by Dmitry & Leustagos.
  • Rank 289 solution code by Foxtrot with original blog post here.
  • Solution thread.

AMS 2013-2014 Solar Energy Prediction Contest

  • Rank 1 solution code and description by Leustagos team.
  • Rank 2 solution code and description by Toulouse.
  • Rank 3 solution code and description by Owen Zhang.
  • Rank 4 solution escription by Peter Prettenhofer.
  • Rank 5 solution description by Domcastro.
  • Rank 58 solution code and description by Davit.
  • Solution thread here.
  • Ridge Regression starter code with MAE about 2.2M by Alec Radford, original thread here.
  • Improved starter code by Foxtrot.
  • Baseline code with MAE about 2.6M using Catmull-Rom Spline interpolation, also available in R here and here.

Belkin Energy Disaggregation Competition

  • Solution thread.

Challenges in Representation Learning: Facial Expression Recognition Challenge

  • Rank 1 solution code and description by Charlie Tang.
  • Rank 3 solution description by Maxim Milakov.
  • Solution thread.

Challenges in Representation Learning: The Black Box Learning Challenge

  • Rank 1 solution description by David Thaler.
  • Rank 2 solution code and description by sayit.

Challenges in Representation Learning: Multi-modal Learning

  • Rank 1 solution by MMDL.
  • Solution thread.

Detecting Insults in Social Commentary

  • Rank 1 solution description and code by Vivek Sharma.
  • Rank 2 solution by tuzzeg.
  • Rank 3 solution description Andrei Olariu.
  • Rank 4 solution by Chris Brew.
  • Rank 5 solution description by Yasser Tabandeh.
  • Rank 6 solution by Andreas Mueller, code available here.
  • Rank 8 solution description by Steve Poulson.
  • Solution thread.

EMI Music Data Science Hackathon

  • Rank 4 solution description by Steffen Rindle.
  • Rank 18 solution code and description by Vlad Gusev.
  • Rank 34 solution code and description by zenog.
  • Solution thread.

Galaxy Zoo – The Galaxy Challenge

  • Rank 1 solution code and description by Sander Dieleman.
  • Rank 2 solution code and description by Maxim Milakov.
  • Rank 3 solution code and description by tund.
  • Rank 5 solution code and description by Julian de Wit.
  • Rank 9 solution code and description by Soumith Chintala.
  • Rank 13 solution code and description by Xiaoxiang Zhang.
  • Rank 28 solution code and description by utdiscant.
  • Rank 38 solution code and description by sugi.
  • Rank 57 solution code and description by hxu.
  • Rank 58 solution code and description by yr.
  • Solution thread.

Global Energy Forecasting Competition 2012 – Wind Forecasting

  • Rank 1 solution by Leustagos.
  • Solution thread here.

KDD Cup 2013 – Author-Paper Identification Challenge (Track 1)

  • Rank 1 solution with code and description by Team Algorithm, Github link to code here.

KDD Cup 2013 – Author Disambiguation Challenge (Track 2)

  • Rank 1 solution with code and description by Team Algorithm, Github link to code here.
  • Rank 2 solution by SmallData Team.
  • Rank 3 solution by hustmonk.
  • Rank 4 solution by Ben S.
  • Solution thread.

Large Scale Hierarchical Text Classification

  • Rank 1 solution code and description by anttip.
  • Rank 3 solution code and description by nagadomi.
  • Solution thread one.
  • Solution thread two.

Loan Default Prediction – Imperial College London

  • Rank 2 solution and description by HelloWorld.
  • Rank 12 solution and description by David McGarry.
  • Solution thread.

Merck Molecular Activity Challenge

  • Ideas sharing discussion thread.
  • Preprocessing techniques discussion thread.

MLSP 2013 Bird Classification Challenge

  • Rank 1 solution code and description by beluga.
  • Rank 2 solution code and description by Herbal Candy (W and thomeou).
  • Rank 3 solution description by Anil Thomas.
  • Rank 4 solution description by Maxim Milakov.
  • Solution thread.

Observing the Dark World

  • Rank 2 solution by Iain Murray, code available here.

PAKDD 2014 – ASUS Malfunctional Components Prediction

  • Yet another solution thread.
  • Solution thread.

Personalize Expedia Hotel Searches – ICDM 2013

  • Presentation paper/slides for ICDM 2013.
  • Solution thread.

Predicting a Biological Response

  • Rank 6 solution by Shea Parkes & Neil Schneider team.
  • Rank 17 solution of Ensemble of RandomForests, GradientBoostingTrees and ExtraTreesRegressor by Emanuele Olivetti.
    • Another solution code by Oblique Random Forest (oRF) by Shea Parkes & Neil Schneider team.
    • The code of my best submission thread. Talks about Multi-core training Oblique Random Forests, and Stacking.
    • Question about the process of ensemble learning thread. Talks about applying ensembles in practice, and how can problems arise and how to deal with them.

    Predicting Closed Questions on Stack Overflow

    • Rank 10 solution by Marco Lui.
    • Rank 33 solution by Foxtrot.

    See Click Predict Fix

    • Solution thread.

    See Click Predict Fix – Hackathon

    • Solution thread.

    StumbleUpon Evergreen Classification Challenge

    • Benchmark beater 1.
    • Benchmark beater 2.
    • Benchmark beater 3.
    • Solution thread.
    • My own solution, which is a good example of what is overfitting. (Public rank: 57, Private rank: 291)

    The Analytics Edge (15.071x)

    • Rank 17 solution code and description by Foxtrot.
    • Solution thread.

    The Marinexplore and Cornell University Whale Detection Challenge

    • Rank 1 solution by Nick Kridler.
    • Rank 7 solution by Gilles Louppe and Peter Prettenhofer team.
    • Rank 8 solution by Sander Dieleman.
    • Rank 56 solution by Sudeep Juvekar.
    • Solution discussion thread.
    • Mean spectogram thread.
    • Official interview from the Marinexplorer and Cornell at Kaggle.

    Walmart Recruiting – Store Sales Forecasting

    • Rank 1 solution code and description by David Thaler.
    • Rank 2 solution description by sriok.
    • Rank 3 solution code and description by James King.
    • Rank 5 solution description by ACS69.
    • Rank 6 solution description by T. Henry.
    • Rank 8 solution description by BreakfastPirate.
    • Rank 9 solution description by Neil Summers.
    • Rank 10 solution description by Gilberto Titericz Junior.
    • Rank 11 solution description by citynight.
    • Rank 16 solution code and description by yr.
    • Rank 29 solution code and description by Mike Kim.
    • Rank 30 solution description by dkay.
    • Solution thread.

    Thank you Foxtrot, James Petterson, Ben S for providing some of the links and solutions above.

这篇关于Kaggle数据挖掘竞赛冠军及优胜者代码的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!



http://www.chinasem.cn/article/545944

相关文章

使用Python和PaddleOCR实现图文识别的代码和步骤

《使用Python和PaddleOCR实现图文识别的代码和步骤》在当今数字化时代,图文识别技术的应用越来越广泛,如文档数字化、信息提取等,PaddleOCR是百度开源的一款强大的OCR工具包,它集成了... 目录一、引言二、环境准备2.1 安装 python2.2 安装 PaddlePaddle2.3 安装

SpringBoot中四种AOP实战应用场景及代码实现

《SpringBoot中四种AOP实战应用场景及代码实现》面向切面编程(AOP)是Spring框架的核心功能之一,它通过预编译和运行期动态代理实现程序功能的统一维护,在SpringBoot应用中,AO... 目录引言场景一:日志记录与性能监控业务需求实现方案使用示例扩展:MDC实现请求跟踪场景二:权限控制与

利用Python调试串口的示例代码

《利用Python调试串口的示例代码》在嵌入式开发、物联网设备调试过程中,串口通信是最基础的调试手段本文将带你用Python+ttkbootstrap打造一款高颜值、多功能的串口调试助手,需要的可以了... 目录概述:为什么需要专业的串口调试工具项目架构设计1.1 技术栈选型1.2 关键类说明1.3 线程模

Python Transformers库(NLP处理库)案例代码讲解

《PythonTransformers库(NLP处理库)案例代码讲解》本文介绍transformers库的全面讲解,包含基础知识、高级用法、案例代码及学习路径,内容经过组织,适合不同阶段的学习者,对... 目录一、基础知识1. Transformers 库简介2. 安装与环境配置3. 快速上手示例二、核心模

Java的栈与队列实现代码解析

《Java的栈与队列实现代码解析》栈是常见的线性数据结构,栈的特点是以先进后出的形式,后进先出,先进后出,分为栈底和栈顶,栈应用于内存的分配,表达式求值,存储临时的数据和方法的调用等,本文给大家介绍J... 目录栈的概念(Stack)栈的实现代码队列(Queue)模拟实现队列(双链表实现)循环队列(循环数组

使用Java将DOCX文档解析为Markdown文档的代码实现

《使用Java将DOCX文档解析为Markdown文档的代码实现》在现代文档处理中,Markdown(MD)因其简洁的语法和良好的可读性,逐渐成为开发者、技术写作者和内容创作者的首选格式,然而,许多文... 目录引言1. 工具和库介绍2. 安装依赖库3. 使用Apache POI解析DOCX文档4. 将解析

C++使用printf语句实现进制转换的示例代码

《C++使用printf语句实现进制转换的示例代码》在C语言中,printf函数可以直接实现部分进制转换功能,通过格式说明符(formatspecifier)快速输出不同进制的数值,下面给大家分享C+... 目录一、printf 原生支持的进制转换1. 十进制、八进制、十六进制转换2. 显示进制前缀3. 指

使用Python实现全能手机虚拟键盘的示例代码

《使用Python实现全能手机虚拟键盘的示例代码》在数字化办公时代,你是否遇到过这样的场景:会议室投影电脑突然键盘失灵、躺在沙发上想远程控制书房电脑、或者需要给长辈远程协助操作?今天我要分享的Pyth... 目录一、项目概述:不止于键盘的远程控制方案1.1 创新价值1.2 技术栈全景二、需求实现步骤一、需求

Java中Date、LocalDate、LocalDateTime、LocalTime、时间戳之间的相互转换代码

《Java中Date、LocalDate、LocalDateTime、LocalTime、时间戳之间的相互转换代码》:本文主要介绍Java中日期时间转换的多种方法,包括将Date转换为LocalD... 目录一、Date转LocalDateTime二、Date转LocalDate三、LocalDateTim

jupyter代码块没有运行图标的解决方案

《jupyter代码块没有运行图标的解决方案》:本文主要介绍jupyter代码块没有运行图标的解决方案,具有很好的参考价值,希望对大家有所帮助,如有错误或未考虑完全的地方,望不吝赐教... 目录jupyter代码块没有运行图标的解决1.找到Jupyter notebook的系统配置文件2.这时候一般会搜索到