AI Research & Paper Review
- (1998) LeNet – Gradient-based Learning Applied to Document Recognition
- (2012) AlexNet – ImageNet Classification with Deep Convolutional Neural Network
- (2014) GooLeNet – Going Deeper with Convolutions
- (2015) VggNet – Very Deep Convolutional Networks for Large-Scale Image Recognition
- (2015) SppNet – Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
- (2015) ResNet – Deep Residual Learning for Image Recognition
- (2016) ResNet 후속 - Identity Mapping in Deep Residual Networks
- (2016) SqueezeNet - AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size
- (2017) Xception – Xception: Deep Learning with Depthwise Separable Convolutions
- (2017) MobileNet – MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Application
- (2019) MobileNet V2-Inverted Residuals and Linear Bottlenecks
- (2019) MobileNet V3-Searching for MobileNetV3
- (2017) ShuffleNet – ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
- (2018) ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design
- (2018) DenseNet – Densely Connected Convolutional Networks
- (2018) NasNet – Learning Transferable Architectures for Scalable Image Recognition
- (2018) Bag of Tricks – Bag of Tricks for Image Classification with Convolutional Neural Networks
- (2019) SeNet – Squeeze and Excitation Networks
- (2020) EfficientNet-Rethinking Model Scaling for Convolutional Neural Networks
- **(2021) EfficientNet v2 - Smaller Models and Faster Training
- (2014) R-CNN - Rich feature hierarchies for accurate object detection and semantic segmentation
- (2015) Fast R-CNN - Fast R-CNN
- (2016) Faster R-CNN - Towards Real-Time Object Detection with Region Proposal Networks
- (2016) FPN - Feature Pyramid Networks for Object Detection
- (2016) SSD - Single Shot MultiBox Detector
- (2016) YOLO V1 - You Only Look Once-Unified, Real-Time Object Detection
- (2016) YOLO V2, 9000 - YOLO9000:Better,Faster,Stronger
- (2018) YOLO V3 - YOLOv3: An Incremental Improvement
- (2018) Retina Net - Focal Loss for Dense Object Detection
- (2020) YOLO V4 Optimal Speed and Accuracy of Object Detection
- (2015) FCN - Fully Convolutional Networks for Semantic Segmentation
- (2015) SegNet - A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
- (2015) U-Net - U-Net:Convolutional Networks for Biomedical Image Segmentation
- (2015) Conv_deconv - Learning Deconvolution Network for Semantic Segmentation
- (2016) Dilated_Conv - Multi Scale Context Aggregation by Dilated Convolutions
- (2016) PSPNet - Pyramid Scene Parsing Network
- (2016) RefineNet - Multi Path Refinement Networks for High-Resolution Semantic Segmentation
- (2016) DeepLab V1 - Semantic Image Segmentation with deep convolutional nets and fully connected crfs
- (2017) DeepLab V2 - DeepLab-Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
- (2017) DeepLab V3 - Rethinking Atrous Convolution for Semantic Image Segmentation
- (2018) DeepLab V3+ - Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
- (2018) Mast R-CNN - Mask R-CNN
- (2016) TextBoxes-A Fast Text Detector with a Single Deep Neural Network
- (2017) EAST-An Efficient and Accurate Scene Text Detector
- (2018) CRNN - An end-to-end TextSpotter with Explicit Alignment and Attention
- (2018) FOTS - Fast Oriented Text Spotting with a Unified Network
- (2019) CRAFT - Character Region Awareness for Text Detection
- (2019) What Is Wrong With Scene Text Recognition Model Comparisons Dataset and Model Analysis
- (2014)_GAN - Generative Adversarial Nets
- (2014)_CGAN - Conditional Generative Adversarial Nets
- (2015)_DCGAN - Unsupervised Representation Learning with Deep Convolutional
- (2016)_f-GAN - Training Generative Neural Samplers using Variational Divergence Minimization
- (2016)_InfoGAN - Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
- (2016)_LSGAN - Least Squares Generative Adversarial Networks
- (2017)_CAN - Creative Adversarial Networks, Generating Art by Learning About Styles and Deviating from Style Norms
- (2017)_CycleGAN_v7 - Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
- (2017)_DiscoGAN - Learning to Discover Cross-Domain Relations with Generative Adversarial Networks
- (2017)_EGAN - Energy-based Generative Adversarial Network
- (2017)_WGAN - Wassertein GAN
- (2017)_WGAN-gp - Improved Training of Wasserstein GANs
- (2018)_StarGAN - Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
- (2018)_StyleGAN - A Style-Based Generator Architecture for Generative Adversarial Networks
- (2020)_StarGAN_v2 - Diverse Image Synthesis for Multiple Domains
- (2020)_StyleGAN_v2 - Analyzing and Improving the Image Quality of StyleGAN
- (2014) C3D - Learning Spatiotemporal Features with 3D Convolutional Networks
- (2014) LRCN - Long-term Recurrent Convolutional Networks for Visual Recognition and Description
- (2014) Two Stream - Two-Stream Convolutional Networks for Action Recognition in Videos
- (2016) LTC - Long-term Temporal Convolutionsfor Action Recognition
- (2017) 3D ResNet - Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks
- (2017) I3D - Quo Vadis, Action Recognition. A New Model and the Kinetics Dataset
- (2017) R(2+1)D - A Closer Look at Spatiotemporal Convolutions for Action Recognition
- (2018) SlowFast - SlowFast Networks for Video Recognition
- (2019) irCSN152 - Large-scale weakly-supervised pre-training for video action recognition
- (2020) LateTemporarl3DCNN - Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition
- (2015) CAM - Learning Deep Features for Discriminative Localization
- (2019) Grad_CAM - Visual Explanations from Deep Networks via Gradient-based Localization
- (2020) Score_CAM - Score Weighted Visual Explanations for Convolutional Neural Networks
- (2015) Distilling the knowledge in a Neural Network
- (2019) Improved Knowledge Distillation via Teacher Assistant
- (2019) Post-Training 4-bit Quantization of convolutional networks for rapid deployment Paper (NeurIPS)
- (2019) Post-Training 4-bit Quantization on Embedding
- (2019) µLayer - Low Latency On-Device Inference Using Cooperative Single-Layer Acceleration and Processor-Friendly Quantization
- (2020) Improving Post Training Neural Quantization
- (2020) EasyQuant - Post-training Quantization via Scale Optimization
- (2021) Post_Training weighted quantization of neural networks for language_models