Deep Learning Papers by task

Papers about deep learning ordered by task, date. Current state-of-the-art papers and papers useful for getting started are labelled.

Object Recognition

YOLO9000: Better, Faster, Stronger, dec 2016, arxiv
Feature Pyramid Networks for Object Detection, dec 2016, arxiv
Densely Connected Convolutional Networks, dec 2016, arxiv
PVANet: Lightweight Deep Neural Networks for Real-time Object Detection, dec 2016, arxiv
Aggregated Residual Transformations for Deep Neural Networks, nov 2016, arxiv
Hierarchical Object Detection with Deep Reinforcement Learning, nov 2016, arxiv
Learning to Make Better Mistakes: Semantics-aware Visual Food Recognition, okt 2016, IBM, paper
T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos, aug 2016, github, arxiv
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning, aug 2016, Google, arxiv
Residual Networks of Residual Networks: Multilevel Residual Networks, aug 2016, arxiv
Context Matters: Refining Object Detection in Video with Recurrent Neural Networks, jul 2016, arxiv
R-FCN: Object Detection via Region-based Fully Convolutional Networks, jun 2016, arxiv
Training Region-based Object Detectors with Online Hard Example Mining, apr 2016, Facebook, arxiv
Deep Residual Learning for Image Recognition, dec 2015, arxiv
SSD: Single Shot MultiBox Detector, dec 2015, Google, github, arxiv
ParseNet: Looking Wider to See Better, jun 2015, arxiv
You Only Look Once: Unified, Real-Time Object Detection, jun 2015, Facebook, arxiv
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, jun 2015, Microsoft/Facebook arxiv
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification, feb 2015, arxiv
Deep Image: Scaling up Image Recognition, jul 2015, arxiv
Rich feature hierarchies for accurate object detection and semantic segmentation, 2014, paper
Selective Search for Object Recognition, 2012, paper
ImageNet Classification with Deep Convolutional Neural Networks, 2012, paper

Image Segmentation

Mask R-CNN, mar 2017, arxiv
Learning Features by Watching Objects Move, dec 2016, Facebook, arxiv
Fully Convolutional Networks for Semantic Segmentation, may 2016, arxiv
Instance-aware Semantic Segmentation via Multi-task Network Cascades, dec 2015, arxiv
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation, dec 2015, arxiv
Learning Rich Features from RGB-D Images for Object Detection and Segmentation, jul 2014, arxiv

Pose Estimation

Fast Single Shot Detection and Pose Estimation, sep 2016, arxiv

Face Recognition

Accessorize to a Crime: Real and Stealthy Attacks on State-of-the-Art Face Recognition, paper
OpenFace: A general-purpose face recognition library with mobile applications, June 2016, paper
Deep Face Recognition, 2015, paper
Compact Convolutional Neural Network Cascade for Face Detection, aug 2015, arxiv
Learning Robust Deep Face Representation, Jul 2015, arxiv
FaceNet: A Unified Embedding for Face Recognition and Clustering, jun 2015, paper
Multi-view Face Detection Using Deep Convolutional Neural Networks, yahoo, feb 2015, arxiv

Style Transfer

Deep Photo Style Transfer, mar 2017, arxiv
A learned representation for artistic style, okt 2016, Google, arxiv, demo
Fast Style Transfer in TensorFlow, github
Instance Normalization: The Missing Ingredient for Fast Stylization, sept 2016, arxiv
A Neural Algorithm of Artistic Style, sept 2015, arxiv
Perceptual Losses for Real-Time Style Transfer and Super-Resolution, arxiv, github

Logo Recognition

Deep Learning Logo Detection with Data Expansion by Synthesising Context, jan 2016, arxiv
Automatic Graphic Logo Detection via Fast Region-based Convolutional Networks, apr 2016, arxiv
Logo Localization and Recognition in Natural Images Using Homographic Class Graphs, 2016, paper
LOGO-Net: Large-scale Deep Logo Detection and Brand Recognition with Deep Region-based Convolutional Networks, nov 2015, arxiv
DeepLogo: Hitting Logo Recognition with the Deep Neural Network Hammer, okt 2015, Berkely, arxiv
Automatic detection of logos in video and their removal using inpainting, jul 2015, paper
On the Benefit of Synthetic Data for Company Logo Detection, 2015, paper
Fast and Robust Realtime Storefront Logo Recognition, paper
Scalable Logo Recognition in Real-World Images, 2011, paper
https://arxiv.org/pdf/1609.01414v1.pdf

note: also includes some papers that use SIFT

Text (in the Wild) Recognition

COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images, jun 2016, arxiv
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild, mar 2016, arxiv
OCR Error Correction Using Character Correction and Feature-Based Word Classification, 2016, arxiv
Efficient Scene Text Localization and Recognition with Local Character Refinement, apr 2015, arxiv
Reading Text in the Wild with Convolutional Neural Networks, dec 2014, arxiv
Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition, jun 2014, arxiv
Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning, 2011, paper

Image / Video Description

Generation and Comprehension of Unambiguous Object Descriptions, apr 2016, arxiv
Long-term Recurrent Convolutional Networks for Visual Recognition and Description, may 2016, arxiv

Detect key actor

Detecting events and key actors in multi-person videos, mar 2015, arxiv

Self-driving cars

End to End Learning for Self-Driving Cars, apr 2016, NVIDIA, arxiv

ConvNet visualization

Visualizing and Understanding Convolutional Networks, nov 2013, arxiv

Object part detection

Discovering the physical parts of an articulated object class from multiple videos, 2016, paper

Pedestrian Detection

Joint Deep Learning for Pedestrian Detection, 2013, paper

Lip Reading

Lip Reading in the Wild, 2016, Oxford, paper

Super Resolution

RAISR: Rapid and Accurate Image Super Resolution, okt 2016, Google, arxiv
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network, sep 2016, Twitter, arxiv

Image Compression

Full Resolution Image Compression with Recurrent Neural Networks, aug 2016, Google arxiv

Automated Theorem Proving

DeepMath - Deep Sequence Models for Premise Selection, jun 2016, Google arxiv

Reverse Engineering

Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data, okt 2016, arxiv
Stealing Machine Learning Models via Prediction APIs, aug 2016, paper

Language

Rationalizing Neural Predictions github, arxiv

Translation

Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation, nov 2016, Google, arxiv
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation, okt 2016, Google arxiv

Age

Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker, dec 2016, arxiv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

task-papers.md

task-papers.md

Deep Learning Papers by task

Object Recognition

Image Segmentation

Pose Estimation

Face Recognition

Style Transfer

Logo Recognition

Text (in the Wild) Recognition

Image / Video Description

Detect key actor

Self-driving cars

ConvNet visualization

Object part detection

Pedestrian Detection

Lip Reading

Super Resolution

Image Compression

Automated Theorem Proving

Reverse Engineering

Language

Translation

Age

Files

task-papers.md

Latest commit

History

task-papers.md

File metadata and controls

Deep Learning Papers by task

Object Recognition

Image Segmentation

Pose Estimation

Face Recognition

Style Transfer

Logo Recognition

Text (in the Wild) Recognition

Image / Video Description

Detect key actor

Self-driving cars

ConvNet visualization

Object part detection

Pedestrian Detection

Lip Reading

Super Resolution

Image Compression

Automated Theorem Proving

Reverse Engineering

Language

Translation

Age