SlideShare a Scribd company logo
Multi-class Image Classification using
Deep Convolutional Networks on
extremely large dataset
Marija Stanojevic
Ashis Chanda
CIS Department, Temple University
5th December 2017
Problem Definition
❖ Kaggle competition: Cdiscount’s Image Classification Challenge
❖ Classify user posted images into 5270 categories
❖ Challenges of dataset:
➢ Huge amount of data and categories
➢ Background clutter (objects blend into environment)
➢ Viewpoint and data scale variation
➢ Deformation
➢ Occlusion
➢ Illumination conditions
❖ Deep learning models seem suitable
Data sets Size # products
Train data 58.2 GB 7,069,896
Test data 14.5 GB 1,768,182
Category # Labels
Category1 49
Category2 483
Category3 5270
Dataset: Product complexity
Dataset: Categories
A sample list of product categories:
Our used models
● CNN
● DenseNet
● ResNet
● ResNext
● Wide-res Net
Models: CNN
❖ Our CNN
➢ 3 Conv, 2 FC layers, 3 Pool layers
➢ thinner layers
➢ downsampling three times
❖ Tried to use for three types of categories
❖ l th layer has connection with all previous layers (l -1)
❖ Normally, each dense block contains 40, 100, 160 layers
❖ We used 3 blocks, 3 layers. Total connection: l x (l -1) /2 = 15
❖ Each layer has BN, ReLU, Conv
❖ No transition layer (downsampling), but used dropout
Models: DenseNet
Models: ResNet
❖ 34, 50, 101, 152 layers
❖ Residual block example
❖ Batch normalization, ReLu
❖ Our residual network
➢ thinner layers (25 conv + 4 pool + fc)
➢ downsampling after each block
➢ no bottleneck
❖ Uses residual block as bases
❖ Block width is divided in k parts
which learn separately
❖ Blocks are wider than in resNet
❖ Distinct cardinalities
❖ 29, 50 and 101 layers
❖ Our resNext:
➢ thinner layers
➢ downsampling after each block
➢ less layers (19 conv + 4 pool + fc)
➢ cardinality 32 always - best
results according to paper
Models: ResNext
❖ Uses residual block as bases
❖ Block is k times wider (k=1, 2, 4, 8, 10, 12)
❖ Dropout beside batch normalization
❖ 16, 22, 28, 40 layers
❖ Our wideResNet:
➢ k = 2 - doesn’t increase much number
of parameters, but shows biggest
improvement in original results
➢ dropout keep = 0.7 - best in original results
➢ less layers (13 conv + 4 pool + fc)
Models: WideResNet
Challenges in dataset
❖ 5.6 TB after splitting data
❖ Train data (707 Batches) & Test data (177 Batches)
❖ Smaller batch: more reads and writes
❖ Bigger batch: memory error
❖ Each batch has 10,000 products ~ 20,000 images
❖ Cross-validation data: 707 products ~ 1500 images
Challenges in implementation
❖ Used owlsnesttwo high performance computing system
❖ GPU: NVIDIA Tesla P100 PCIe 12GB
❖ Only two jobs allowed in parallel
❖ Implemented with tensorflow and tflearn in python
❖ Network
➢ Complexity in debugging
➢ Bigger network: memory error => tuning network
➢ Thinner and less layers
➢ Small number of epoch
Baseline (CNN) Result
❖ CNN: (10 Epochs, 10 Batches)
❖ CNN: (50 Epochs, 50 Batches) (in progress)
Label # class Accuracy Error Time
Category 1 49 52% 10.7 4 hours
Category 2 483 47% 11.68 5 hours
Category 3 5270 32% 4.89 11 hours
Category 1 49 12%
(1 epoch)
3.42 20 hours
Experimental Result (category 3)
Model # Batch # Epoch Accuracy Error Time
DenseNet 50/707 1 (running) 4% 9.83 10 days
ResNet 50/707 10 34% 3.94489 5.69 hours
ResNext 50/707 10 28.9% 4.51031 16.8 hours
WideResNext 50/707 10 41.93% 3.53318 6.11 hours
ResNet 707/707 3 (running) 36.69% 3.84427 3.35 days
ResNext 707/707 1 (running) 28.47% 4.50394 9.90 days
WideResNext 707/707 2 (running) 40.51% 3.49804 3.60 days
We are here now
Experimental results: Error in classifying
Mobile cover bag Mobile Case Mobile
Laptop cover bag Mobile film protector Mobile Case
Conclusions and Future Work
❖ Baseline: CNN; Proposed: resNet, resNext, denseNet, wideResNet
❖ All proposed networks have similar number of parameters
❖ wideResNet performs the best
❖ resNext gave worst results and is 3 times slower than resNet
❖ DenseNet requires GPU with huge memory
❖ Requires a lot of time, huge memory and
fast computational resources
❖ Number of epochs has to be 70+
❖ Future: Submit result to Kaggle competition
Thank you
Questions?
Ad

More Related Content

What's hot (20)

Batch normalization paper review
Batch normalization paper reviewBatch normalization paper review
Batch normalization paper review
Minho Heo
 
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
UMBC
 
Enhanced Deep Residual Networks for Single Image Super-Resolution
Enhanced Deep Residual Networks for Single Image Super-ResolutionEnhanced Deep Residual Networks for Single Image Super-Resolution
Enhanced Deep Residual Networks for Single Image Super-Resolution
NAVER Engineering
 
Pruning convolutional neural networks for resource efficient inference
Pruning convolutional neural networks for resource efficient inferencePruning convolutional neural networks for resource efficient inference
Pruning convolutional neural networks for resource efficient inference
Kaushalya Madhawa
 
Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)
SungminYou
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Jia-Bin Huang
 
Visualizaing and understanding convolutional networks
Visualizaing and understanding convolutional networksVisualizaing and understanding convolutional networks
Visualizaing and understanding convolutional networks
SungminYou
 
PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation
PR12-193 NISP: Pruning Networks using Neural Importance Score PropagationPR12-193 NISP: Pruning Networks using Neural Importance Score Propagation
PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation
Taesu Kim
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural network
Itachi SK
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNet
SungminYou
 
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Wanjin Yu
 
DeepFix: a fully convolutional neural network for predicting human fixations...
DeepFix:  a fully convolutional neural network for predicting human fixations...DeepFix:  a fully convolutional neural network for predicting human fixations...
DeepFix: a fully convolutional neural network for predicting human fixations...
Universitat Politècnica de Catalunya
 
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationDeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
NamHyuk Ahn
 
[Chung il kim] 0829 thesis
[Chung il kim] 0829 thesis[Chung il kim] 0829 thesis
[Chung il kim] 0829 thesis
Chung-Il Kim
 
PR243: Designing Network Design Spaces
PR243: Designing Network Design SpacesPR243: Designing Network Design Spaces
PR243: Designing Network Design Spaces
Jinwon Lee
 
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
IOSR Journals
 
Convolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular ArchitecturesConvolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular Architectures
ananth
 
Understanding Convolutional Neural Networks
Understanding Convolutional Neural NetworksUnderstanding Convolutional Neural Networks
Understanding Convolutional Neural Networks
Jeremy Nixon
 
LeNet to ResNet
LeNet to ResNetLeNet to ResNet
LeNet to ResNet
Somnath Banerjee
 
CNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent AdvancesCNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent Advances
Dmytro Mishkin
 
Batch normalization paper review
Batch normalization paper reviewBatch normalization paper review
Batch normalization paper review
Minho Heo
 
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
UMBC
 
Enhanced Deep Residual Networks for Single Image Super-Resolution
Enhanced Deep Residual Networks for Single Image Super-ResolutionEnhanced Deep Residual Networks for Single Image Super-Resolution
Enhanced Deep Residual Networks for Single Image Super-Resolution
NAVER Engineering
 
Pruning convolutional neural networks for resource efficient inference
Pruning convolutional neural networks for resource efficient inferencePruning convolutional neural networks for resource efficient inference
Pruning convolutional neural networks for resource efficient inference
Kaushalya Madhawa
 
Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)
SungminYou
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Jia-Bin Huang
 
Visualizaing and understanding convolutional networks
Visualizaing and understanding convolutional networksVisualizaing and understanding convolutional networks
Visualizaing and understanding convolutional networks
SungminYou
 
PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation
PR12-193 NISP: Pruning Networks using Neural Importance Score PropagationPR12-193 NISP: Pruning Networks using Neural Importance Score Propagation
PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation
Taesu Kim
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural network
Itachi SK
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNet
SungminYou
 
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Wanjin Yu
 
DeepFix: a fully convolutional neural network for predicting human fixations...
DeepFix:  a fully convolutional neural network for predicting human fixations...DeepFix:  a fully convolutional neural network for predicting human fixations...
DeepFix: a fully convolutional neural network for predicting human fixations...
Universitat Politècnica de Catalunya
 
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationDeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
NamHyuk Ahn
 
[Chung il kim] 0829 thesis
[Chung il kim] 0829 thesis[Chung il kim] 0829 thesis
[Chung il kim] 0829 thesis
Chung-Il Kim
 
PR243: Designing Network Design Spaces
PR243: Designing Network Design SpacesPR243: Designing Network Design Spaces
PR243: Designing Network Design Spaces
Jinwon Lee
 
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
IOSR Journals
 
Convolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular ArchitecturesConvolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular Architectures
ananth
 
Understanding Convolutional Neural Networks
Understanding Convolutional Neural NetworksUnderstanding Convolutional Neural Networks
Understanding Convolutional Neural Networks
Jeremy Nixon
 
CNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent AdvancesCNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent Advances
Dmytro Mishkin
 

Similar to Multi-class Image Classification using deep convolutional networks on extremely large dataset (20)

Lightweight DNN Processor Design (based on NVDLA)
Lightweight DNN Processor Design (based on NVDLA)Lightweight DNN Processor Design (based on NVDLA)
Lightweight DNN Processor Design (based on NVDLA)
Shien-Chun Luo
 
Deep Learning for Computer Vision: Memory usage and computational considerati...
Deep Learning for Computer Vision: Memory usage and computational considerati...Deep Learning for Computer Vision: Memory usage and computational considerati...
Deep Learning for Computer Vision: Memory usage and computational considerati...
Universitat Politècnica de Catalunya
 
Manycores for the Masses
Manycores for the MassesManycores for the Masses
Manycores for the Masses
Intel® Software
 
Introduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNetIntroduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNet
KrishnakoumarC
 
Once-for-All: Train One Network and Specialize it for Efficient Deployment
 Once-for-All: Train One Network and Specialize it for Efficient Deployment Once-for-All: Train One Network and Specialize it for Efficient Deployment
Once-for-All: Train One Network and Specialize it for Efficient Deployment
taeseon ryu
 
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
Codemotion
 
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...
e8xu
 
Distributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learnedDistributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learned
Wee Hyong Tok
 
Convolutional Neural Network Models - Deep Learning
Convolutional Neural Network Models - Deep LearningConvolutional Neural Network Models - Deep Learning
Convolutional Neural Network Models - Deep Learning
Mohamed Loey
 
Intelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiencyIntelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiency
Qualcomm Research
 
YOLO9000 - PR023
YOLO9000 - PR023YOLO9000 - PR023
YOLO9000 - PR023
Jinwon Lee
 
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC SystemsImproving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
HPCC Systems
 
CNN, Deep Learning ResNet_30_Slide_Presentation.pptx
CNN, Deep Learning ResNet_30_Slide_Presentation.pptxCNN, Deep Learning ResNet_30_Slide_Presentation.pptx
CNN, Deep Learning ResNet_30_Slide_Presentation.pptx
OnUrTipsIncorporatio
 
Chatbot
ChatbotChatbot
Chatbot
Liam Bui
 
Salt Identification Challenge
Salt Identification ChallengeSalt Identification Challenge
Salt Identification Challenge
kenluck2001
 
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methodsA brief introduction to recent segmentation methods
A brief introduction to recent segmentation methods
Shunta Saito
 
NVIDIA深度學習教育機構 (DLI): Object detection with jetson
NVIDIA深度學習教育機構 (DLI): Object detection with jetsonNVIDIA深度學習教育機構 (DLI): Object detection with jetson
NVIDIA深度學習教育機構 (DLI): Object detection with jetson
NVIDIA Taiwan
 
The Next Chapter in the Sordid Love/Hate Relationship Between DBs and OSes by...
The Next Chapter in the Sordid Love/Hate Relationship Between DBs and OSes by...The Next Chapter in the Sordid Love/Hate Relationship Between DBs and OSes by...
The Next Chapter in the Sordid Love/Hate Relationship Between DBs and OSes by...
ScyllaDB
 
Trackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterTrackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity Calorimeter
Yousef Fadila
 
Machine Learning approaches at video compression
Machine Learning approaches at video compression Machine Learning approaches at video compression
Machine Learning approaches at video compression
Roberto Iacoviello
 
Lightweight DNN Processor Design (based on NVDLA)
Lightweight DNN Processor Design (based on NVDLA)Lightweight DNN Processor Design (based on NVDLA)
Lightweight DNN Processor Design (based on NVDLA)
Shien-Chun Luo
 
Deep Learning for Computer Vision: Memory usage and computational considerati...
Deep Learning for Computer Vision: Memory usage and computational considerati...Deep Learning for Computer Vision: Memory usage and computational considerati...
Deep Learning for Computer Vision: Memory usage and computational considerati...
Universitat Politècnica de Catalunya
 
Introduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNetIntroduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNet
KrishnakoumarC
 
Once-for-All: Train One Network and Specialize it for Efficient Deployment
 Once-for-All: Train One Network and Specialize it for Efficient Deployment Once-for-All: Train One Network and Specialize it for Efficient Deployment
Once-for-All: Train One Network and Specialize it for Efficient Deployment
taeseon ryu
 
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
Codemotion
 
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...
(Paper note) Real time rgb-d camera relocalization via randomized ferns for k...
e8xu
 
Distributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learnedDistributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learned
Wee Hyong Tok
 
Convolutional Neural Network Models - Deep Learning
Convolutional Neural Network Models - Deep LearningConvolutional Neural Network Models - Deep Learning
Convolutional Neural Network Models - Deep Learning
Mohamed Loey
 
Intelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiencyIntelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiency
Qualcomm Research
 
YOLO9000 - PR023
YOLO9000 - PR023YOLO9000 - PR023
YOLO9000 - PR023
Jinwon Lee
 
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC SystemsImproving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
HPCC Systems
 
CNN, Deep Learning ResNet_30_Slide_Presentation.pptx
CNN, Deep Learning ResNet_30_Slide_Presentation.pptxCNN, Deep Learning ResNet_30_Slide_Presentation.pptx
CNN, Deep Learning ResNet_30_Slide_Presentation.pptx
OnUrTipsIncorporatio
 
Salt Identification Challenge
Salt Identification ChallengeSalt Identification Challenge
Salt Identification Challenge
kenluck2001
 
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methodsA brief introduction to recent segmentation methods
A brief introduction to recent segmentation methods
Shunta Saito
 
NVIDIA深度學習教育機構 (DLI): Object detection with jetson
NVIDIA深度學習教育機構 (DLI): Object detection with jetsonNVIDIA深度學習教育機構 (DLI): Object detection with jetson
NVIDIA深度學習教育機構 (DLI): Object detection with jetson
NVIDIA Taiwan
 
The Next Chapter in the Sordid Love/Hate Relationship Between DBs and OSes by...
The Next Chapter in the Sordid Love/Hate Relationship Between DBs and OSes by...The Next Chapter in the Sordid Love/Hate Relationship Between DBs and OSes by...
The Next Chapter in the Sordid Love/Hate Relationship Between DBs and OSes by...
ScyllaDB
 
Trackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterTrackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity Calorimeter
Yousef Fadila
 
Machine Learning approaches at video compression
Machine Learning approaches at video compression Machine Learning approaches at video compression
Machine Learning approaches at video compression
Roberto Iacoviello
 
Ad

More from Ashis Kumar Chanda (20)

Word 2 vector
Word 2 vectorWord 2 vector
Word 2 vector
Ashis Kumar Chanda
 
Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
Ashis Kumar Chanda
 
03. Agile Development
03. Agile Development03. Agile Development
03. Agile Development
Ashis Kumar Chanda
 
Software Cost Estimation
Software Cost EstimationSoftware Cost Estimation
Software Cost Estimation
Ashis Kumar Chanda
 
Risk Management
Risk ManagementRisk Management
Risk Management
Ashis Kumar Chanda
 
Project Management
Project ManagementProject Management
Project Management
Ashis Kumar Chanda
 
MVC
MVCMVC
MVC
Ashis Kumar Chanda
 
Requirements engineering
Requirements engineeringRequirements engineering
Requirements engineering
Ashis Kumar Chanda
 
4. UML
4. UML4. UML
4. UML
Ashis Kumar Chanda
 
2. Software process
2. Software process2. Software process
2. Software process
Ashis Kumar Chanda
 
1. Introduction
1. Introduction1. Introduction
1. Introduction
Ashis Kumar Chanda
 
Periodic pattern mining
Periodic pattern miningPeriodic pattern mining
Periodic pattern mining
Ashis Kumar Chanda
 
FPPM algorithm
FPPM algorithmFPPM algorithm
FPPM algorithm
Ashis Kumar Chanda
 
Secure software design
Secure software designSecure software design
Secure software design
Ashis Kumar Chanda
 
Sequential logic circuit optimization
Sequential logic circuit optimizationSequential logic circuit optimization
Sequential logic circuit optimization
Ashis Kumar Chanda
 
Introduction to CS
Introduction to CSIntroduction to CS
Introduction to CS
Ashis Kumar Chanda
 
Iterative deepening search
Iterative deepening searchIterative deepening search
Iterative deepening search
Ashis Kumar Chanda
 
CloudBus
CloudBusCloudBus
CloudBus
Ashis Kumar Chanda
 
Linear Machine Decision Tree
Linear Machine Decision TreeLinear Machine Decision Tree
Linear Machine Decision Tree
Ashis Kumar Chanda
 
Logical Operations on BDD
Logical Operations on BDDLogical Operations on BDD
Logical Operations on BDD
Ashis Kumar Chanda
 
Ad

Recently uploaded (20)

Water Industry Process Automation & Control Monthly May 2025
Water Industry Process Automation & Control Monthly May 2025Water Industry Process Automation & Control Monthly May 2025
Water Industry Process Automation & Control Monthly May 2025
Water Industry Process Automation & Control
 
Control Methods of Noise Pollutions.pptx
Control Methods of Noise Pollutions.pptxControl Methods of Noise Pollutions.pptx
Control Methods of Noise Pollutions.pptx
vvsasane
 
Slide share PPT of SOx control technologies.pptx
Slide share PPT of SOx control technologies.pptxSlide share PPT of SOx control technologies.pptx
Slide share PPT of SOx control technologies.pptx
vvsasane
 
Machine foundation notes for civil engineering students
Machine foundation notes for civil engineering studentsMachine foundation notes for civil engineering students
Machine foundation notes for civil engineering students
DYPCET
 
JRR Tolkien’s Lord of the Rings: Was It Influenced by Nordic Mythology, Homer...
JRR Tolkien’s Lord of the Rings: Was It Influenced by Nordic Mythology, Homer...JRR Tolkien’s Lord of the Rings: Was It Influenced by Nordic Mythology, Homer...
JRR Tolkien’s Lord of the Rings: Was It Influenced by Nordic Mythology, Homer...
Reflections on Morality, Philosophy, and History
 
David Boutry - Specializes In AWS, Microservices And Python.pdf
David Boutry - Specializes In AWS, Microservices And Python.pdfDavid Boutry - Specializes In AWS, Microservices And Python.pdf
David Boutry - Specializes In AWS, Microservices And Python.pdf
David Boutry
 
Modelling of Concrete Compressive Strength Admixed with GGBFS Using Gene Expr...
Modelling of Concrete Compressive Strength Admixed with GGBFS Using Gene Expr...Modelling of Concrete Compressive Strength Admixed with GGBFS Using Gene Expr...
Modelling of Concrete Compressive Strength Admixed with GGBFS Using Gene Expr...
Journal of Soft Computing in Civil Engineering
 
22PCOAM16 ML Unit 3 Full notes PDF & QB.pdf
22PCOAM16 ML Unit 3 Full notes PDF & QB.pdf22PCOAM16 ML Unit 3 Full notes PDF & QB.pdf
22PCOAM16 ML Unit 3 Full notes PDF & QB.pdf
Guru Nanak Technical Institutions
 
01.คุณลักษณะเฉพาะของอุปกรณ์_pagenumber.pdf
01.คุณลักษณะเฉพาะของอุปกรณ์_pagenumber.pdf01.คุณลักษณะเฉพาะของอุปกรณ์_pagenumber.pdf
01.คุณลักษณะเฉพาะของอุปกรณ์_pagenumber.pdf
PawachMetharattanara
 
Mode-Wise Corridor Level Travel-Time Estimation Using Machine Learning Models
Mode-Wise Corridor Level Travel-Time Estimation Using Machine Learning ModelsMode-Wise Corridor Level Travel-Time Estimation Using Machine Learning Models
Mode-Wise Corridor Level Travel-Time Estimation Using Machine Learning Models
Journal of Soft Computing in Civil Engineering
 
hypermedia_system_revisit_roy_fielding .
hypermedia_system_revisit_roy_fielding .hypermedia_system_revisit_roy_fielding .
hypermedia_system_revisit_roy_fielding .
NABLAS株式会社
 
Frontend Architecture Diagram/Guide For Frontend Engineers
Frontend Architecture Diagram/Guide For Frontend EngineersFrontend Architecture Diagram/Guide For Frontend Engineers
Frontend Architecture Diagram/Guide For Frontend Engineers
Michael Hertzberg
 
SICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introductionSICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introduction
fabienklr
 
Jacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia - Excels In Optimizing Software ApplicationsJacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia
 
How to Build a Desktop Weather Station Using ESP32 and E-ink Display
How to Build a Desktop Weather Station Using ESP32 and E-ink DisplayHow to Build a Desktop Weather Station Using ESP32 and E-ink Display
How to Build a Desktop Weather Station Using ESP32 and E-ink Display
CircuitDigest
 
Automatic Quality Assessment for Speech and Beyond
Automatic Quality Assessment for Speech and BeyondAutomatic Quality Assessment for Speech and Beyond
Automatic Quality Assessment for Speech and Beyond
NU_I_TODALAB
 
Evonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdfEvonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdf
szhang13
 
Machine Learning basics POWERPOINT PRESENETATION
Machine Learning basics POWERPOINT PRESENETATIONMachine Learning basics POWERPOINT PRESENETATION
Machine Learning basics POWERPOINT PRESENETATION
DarrinBright1
 
Transport modelling at SBB, presentation at EPFL in 2025
Transport modelling at SBB, presentation at EPFL in 2025Transport modelling at SBB, presentation at EPFL in 2025
Transport modelling at SBB, presentation at EPFL in 2025
Antonin Danalet
 
Little Known Ways To 3 Best sites to Buy Linkedin Accounts.pdf
Little Known Ways To 3 Best sites to Buy Linkedin Accounts.pdfLittle Known Ways To 3 Best sites to Buy Linkedin Accounts.pdf
Little Known Ways To 3 Best sites to Buy Linkedin Accounts.pdf
gori42199
 
Control Methods of Noise Pollutions.pptx
Control Methods of Noise Pollutions.pptxControl Methods of Noise Pollutions.pptx
Control Methods of Noise Pollutions.pptx
vvsasane
 
Slide share PPT of SOx control technologies.pptx
Slide share PPT of SOx control technologies.pptxSlide share PPT of SOx control technologies.pptx
Slide share PPT of SOx control technologies.pptx
vvsasane
 
Machine foundation notes for civil engineering students
Machine foundation notes for civil engineering studentsMachine foundation notes for civil engineering students
Machine foundation notes for civil engineering students
DYPCET
 
David Boutry - Specializes In AWS, Microservices And Python.pdf
David Boutry - Specializes In AWS, Microservices And Python.pdfDavid Boutry - Specializes In AWS, Microservices And Python.pdf
David Boutry - Specializes In AWS, Microservices And Python.pdf
David Boutry
 
01.คุณลักษณะเฉพาะของอุปกรณ์_pagenumber.pdf
01.คุณลักษณะเฉพาะของอุปกรณ์_pagenumber.pdf01.คุณลักษณะเฉพาะของอุปกรณ์_pagenumber.pdf
01.คุณลักษณะเฉพาะของอุปกรณ์_pagenumber.pdf
PawachMetharattanara
 
hypermedia_system_revisit_roy_fielding .
hypermedia_system_revisit_roy_fielding .hypermedia_system_revisit_roy_fielding .
hypermedia_system_revisit_roy_fielding .
NABLAS株式会社
 
Frontend Architecture Diagram/Guide For Frontend Engineers
Frontend Architecture Diagram/Guide For Frontend EngineersFrontend Architecture Diagram/Guide For Frontend Engineers
Frontend Architecture Diagram/Guide For Frontend Engineers
Michael Hertzberg
 
SICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introductionSICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introduction
fabienklr
 
Jacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia - Excels In Optimizing Software ApplicationsJacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia
 
How to Build a Desktop Weather Station Using ESP32 and E-ink Display
How to Build a Desktop Weather Station Using ESP32 and E-ink DisplayHow to Build a Desktop Weather Station Using ESP32 and E-ink Display
How to Build a Desktop Weather Station Using ESP32 and E-ink Display
CircuitDigest
 
Automatic Quality Assessment for Speech and Beyond
Automatic Quality Assessment for Speech and BeyondAutomatic Quality Assessment for Speech and Beyond
Automatic Quality Assessment for Speech and Beyond
NU_I_TODALAB
 
Evonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdfEvonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdf
szhang13
 
Machine Learning basics POWERPOINT PRESENETATION
Machine Learning basics POWERPOINT PRESENETATIONMachine Learning basics POWERPOINT PRESENETATION
Machine Learning basics POWERPOINT PRESENETATION
DarrinBright1
 
Transport modelling at SBB, presentation at EPFL in 2025
Transport modelling at SBB, presentation at EPFL in 2025Transport modelling at SBB, presentation at EPFL in 2025
Transport modelling at SBB, presentation at EPFL in 2025
Antonin Danalet
 
Little Known Ways To 3 Best sites to Buy Linkedin Accounts.pdf
Little Known Ways To 3 Best sites to Buy Linkedin Accounts.pdfLittle Known Ways To 3 Best sites to Buy Linkedin Accounts.pdf
Little Known Ways To 3 Best sites to Buy Linkedin Accounts.pdf
gori42199
 

Multi-class Image Classification using deep convolutional networks on extremely large dataset

  • 1. Multi-class Image Classification using Deep Convolutional Networks on extremely large dataset Marija Stanojevic Ashis Chanda CIS Department, Temple University 5th December 2017
  • 2. Problem Definition ❖ Kaggle competition: Cdiscount’s Image Classification Challenge ❖ Classify user posted images into 5270 categories ❖ Challenges of dataset: ➢ Huge amount of data and categories ➢ Background clutter (objects blend into environment) ➢ Viewpoint and data scale variation ➢ Deformation ➢ Occlusion ➢ Illumination conditions ❖ Deep learning models seem suitable Data sets Size # products Train data 58.2 GB 7,069,896 Test data 14.5 GB 1,768,182 Category # Labels Category1 49 Category2 483 Category3 5270
  • 4. Dataset: Categories A sample list of product categories:
  • 5. Our used models ● CNN ● DenseNet ● ResNet ● ResNext ● Wide-res Net
  • 6. Models: CNN ❖ Our CNN ➢ 3 Conv, 2 FC layers, 3 Pool layers ➢ thinner layers ➢ downsampling three times ❖ Tried to use for three types of categories
  • 7. ❖ l th layer has connection with all previous layers (l -1) ❖ Normally, each dense block contains 40, 100, 160 layers ❖ We used 3 blocks, 3 layers. Total connection: l x (l -1) /2 = 15 ❖ Each layer has BN, ReLU, Conv ❖ No transition layer (downsampling), but used dropout Models: DenseNet
  • 8. Models: ResNet ❖ 34, 50, 101, 152 layers ❖ Residual block example ❖ Batch normalization, ReLu ❖ Our residual network ➢ thinner layers (25 conv + 4 pool + fc) ➢ downsampling after each block ➢ no bottleneck
  • 9. ❖ Uses residual block as bases ❖ Block width is divided in k parts which learn separately ❖ Blocks are wider than in resNet ❖ Distinct cardinalities ❖ 29, 50 and 101 layers ❖ Our resNext: ➢ thinner layers ➢ downsampling after each block ➢ less layers (19 conv + 4 pool + fc) ➢ cardinality 32 always - best results according to paper Models: ResNext
  • 10. ❖ Uses residual block as bases ❖ Block is k times wider (k=1, 2, 4, 8, 10, 12) ❖ Dropout beside batch normalization ❖ 16, 22, 28, 40 layers ❖ Our wideResNet: ➢ k = 2 - doesn’t increase much number of parameters, but shows biggest improvement in original results ➢ dropout keep = 0.7 - best in original results ➢ less layers (13 conv + 4 pool + fc) Models: WideResNet
  • 11. Challenges in dataset ❖ 5.6 TB after splitting data ❖ Train data (707 Batches) & Test data (177 Batches) ❖ Smaller batch: more reads and writes ❖ Bigger batch: memory error ❖ Each batch has 10,000 products ~ 20,000 images ❖ Cross-validation data: 707 products ~ 1500 images
  • 12. Challenges in implementation ❖ Used owlsnesttwo high performance computing system ❖ GPU: NVIDIA Tesla P100 PCIe 12GB ❖ Only two jobs allowed in parallel ❖ Implemented with tensorflow and tflearn in python ❖ Network ➢ Complexity in debugging ➢ Bigger network: memory error => tuning network ➢ Thinner and less layers ➢ Small number of epoch
  • 13. Baseline (CNN) Result ❖ CNN: (10 Epochs, 10 Batches) ❖ CNN: (50 Epochs, 50 Batches) (in progress) Label # class Accuracy Error Time Category 1 49 52% 10.7 4 hours Category 2 483 47% 11.68 5 hours Category 3 5270 32% 4.89 11 hours Category 1 49 12% (1 epoch) 3.42 20 hours
  • 14. Experimental Result (category 3) Model # Batch # Epoch Accuracy Error Time DenseNet 50/707 1 (running) 4% 9.83 10 days ResNet 50/707 10 34% 3.94489 5.69 hours ResNext 50/707 10 28.9% 4.51031 16.8 hours WideResNext 50/707 10 41.93% 3.53318 6.11 hours ResNet 707/707 3 (running) 36.69% 3.84427 3.35 days ResNext 707/707 1 (running) 28.47% 4.50394 9.90 days WideResNext 707/707 2 (running) 40.51% 3.49804 3.60 days
  • 15. We are here now
  • 16. Experimental results: Error in classifying Mobile cover bag Mobile Case Mobile Laptop cover bag Mobile film protector Mobile Case
  • 17. Conclusions and Future Work ❖ Baseline: CNN; Proposed: resNet, resNext, denseNet, wideResNet ❖ All proposed networks have similar number of parameters ❖ wideResNet performs the best ❖ resNext gave worst results and is 3 times slower than resNet ❖ DenseNet requires GPU with huge memory ❖ Requires a lot of time, huge memory and fast computational resources ❖ Number of epochs has to be 70+ ❖ Future: Submit result to Kaggle competition
  翻译: