Skip to content

SeonminKim1/AI-Research

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI_Research

AI Research & Paper Review

CNN

Classification Paper

  • (1998) LeNet – Gradient-based Learning Applied to Document Recognition
  • (2012) AlexNet – ImageNet Classification with Deep Convolutional Neural Network
  • (2014) GooLeNet – Going Deeper with Convolutions
  • (2015) VggNet – Very Deep Convolutional Networks for Large-Scale Image Recognition
  • (2015) SppNet – Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
  • (2015) ResNet – Deep Residual Learning for Image Recognition
    • (2016) ResNet 후속 - Identity Mapping in Deep Residual Networks
  • (2016) SqueezeNet - AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size
  • (2017) Xception – Xception: Deep Learning with Depthwise Separable Convolutions
  • (2017) MobileNet – MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Application
    • (2019) MobileNet V2-Inverted Residuals and Linear Bottlenecks
    • (2019) MobileNet V3-Searching for MobileNetV3
  • (2017) ShuffleNet – ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
    • (2018) ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design
  • (2018) DenseNet – Densely Connected Convolutional Networks
  • (2018) NasNet – Learning Transferable Architectures for Scalable Image Recognition
  • (2018) Bag of Tricks – Bag of Tricks for Image Classification with Convolutional Neural Networks
  • (2019) SeNet – Squeeze and Excitation Networks
  • (2020) EfficientNet-Rethinking Model Scaling for Convolutional Neural Networks
    • **(2021) EfficientNet v2 - Smaller Models and Faster Training

Object Detection

  • (2014) R-CNN - Rich feature hierarchies for accurate object detection and semantic segmentation
  • (2015) Fast R-CNN - Fast R-CNN
  • (2016) Faster R-CNN - Towards Real-Time Object Detection with Region Proposal Networks
  • (2016) FPN - Feature Pyramid Networks for Object Detection
  • (2016) SSD - Single Shot MultiBox Detector
  • (2016) YOLO V1 - You Only Look Once-Unified, Real-Time Object Detection
  • (2016) YOLO V2, 9000 - YOLO9000:Better,Faster,Stronger
  • (2018) YOLO V3 - YOLOv3: An Incremental Improvement
  • (2018) Retina Net - Focal Loss for Dense Object Detection
  • (2020) YOLO V4 Optimal Speed and Accuracy of Object Detection

Segmentation

  • (2015) FCN - Fully Convolutional Networks for Semantic Segmentation
  • (2015) SegNet - A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
  • (2015) U-Net - U-Net:Convolutional Networks for Biomedical Image Segmentation
  • (2015) Conv_deconv - Learning Deconvolution Network for Semantic Segmentation
  • (2016) Dilated_Conv - Multi Scale Context Aggregation by Dilated Convolutions
  • (2016) PSPNet - Pyramid Scene Parsing Network
  • (2016) RefineNet - Multi Path Refinement Networks for High-Resolution Semantic Segmentation
  • (2016) DeepLab V1 - Semantic Image Segmentation with deep convolutional nets and fully connected crfs
  • (2017) DeepLab V2 - DeepLab-Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
  • (2017) DeepLab V3 - Rethinking Atrous Convolution for Semantic Image Segmentation
  • (2018) DeepLab V3+ - Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
  • (2018) Mast R-CNN - Mask R-CNN

OCR (Optimcal Character Recognition)

  • (2016) TextBoxes-A Fast Text Detector with a Single Deep Neural Network
  • (2017) EAST-An Efficient and Accurate Scene Text Detector
  • (2018) CRNN - An end-to-end TextSpotter with Explicit Alignment and Attention
  • (2018) FOTS - Fast Oriented Text Spotting with a Unified Network
  • (2019) CRAFT - Character Region Awareness for Text Detection
  • (2019) What Is Wrong With Scene Text Recognition Model Comparisons Dataset and Model Analysis

GAN

  • (2014)_GAN - Generative Adversarial Nets
  • (2014)_CGAN - Conditional Generative Adversarial Nets
  • (2015)_DCGAN - Unsupervised Representation Learning with Deep Convolutional
  • (2016)_f-GAN - Training Generative Neural Samplers using Variational Divergence Minimization
  • (2016)_InfoGAN - Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
  • (2016)_LSGAN - Least Squares Generative Adversarial Networks
  • (2017)_CAN - Creative Adversarial Networks, Generating Art by Learning About Styles and Deviating from Style Norms
  • (2017)_CycleGAN_v7 - Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
  • (2017)_DiscoGAN - Learning to Discover Cross-Domain Relations with Generative Adversarial Networks
  • (2017)_EGAN - Energy-based Generative Adversarial Network
  • (2017)_WGAN - Wassertein GAN
  • (2017)_WGAN-gp - Improved Training of Wasserstein GANs
  • (2018)_StarGAN - Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
  • (2018)_StyleGAN - A Style-Based Generator Architecture for Generative Adversarial Networks
  • (2020)_StarGAN_v2 - Diverse Image Synthesis for Multiple Domains
  • (2020)_StyleGAN_v2 - Analyzing and Improving the Image Quality of StyleGAN

Video Action Recognition

  • (2014) C3D - Learning Spatiotemporal Features with 3D Convolutional Networks
  • (2014) LRCN - Long-term Recurrent Convolutional Networks for Visual Recognition and Description
  • (2014) Two Stream - Two-Stream Convolutional Networks for Action Recognition in Videos
  • (2016) LTC - Long-term Temporal Convolutionsfor Action Recognition
  • (2017) 3D ResNet - Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks
  • (2017) I3D - Quo Vadis, Action Recognition. A New Model and the Kinetics Dataset
  • (2017) R(2+1)D - A Closer Look at Spatiotemporal Convolutions for Action Recognition
  • (2018) SlowFast - SlowFast Networks for Video Recognition
  • (2019) irCSN152 - Large-scale weakly-supervised pre-training for video action recognition
  • (2020) LateTemporarl3DCNN - Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition

XAI (Explaniable AI)

  • (2015) CAM - Learning Deep Features for Discriminative Localization
  • (2019) Grad_CAM - Visual Explanations from Deep Networks via Gradient-based Localization
  • (2020) Score_CAM - Score Weighted Visual Explanations for Convolutional Neural Networks

Knowledge Distillation

  • (2015) Distilling the knowledge in a Neural Network
  • (2019) Improved Knowledge Distillation via Teacher Assistant

Quantization

  • (2019) Post-Training 4-bit Quantization of convolutional networks for rapid deployment Paper (NeurIPS)
  • (2019) Post-Training 4-bit Quantization on Embedding
  • (2019) µLayer - Low Latency On-Device Inference Using Cooperative Single-Layer Acceleration and Processor-Friendly Quantization
  • (2020) Improving Post Training Neural Quantization
  • (2020) EasyQuant - Post-training Quantization via Scale Optimization
  • (2021) Post_Training weighted quantization of neural networks for language_models

Releases

No releases published

Packages

No packages published