SlideShare a Scribd company logo
Scene Text Detection on Images
   using Cellular Automata
  Konstantinos Zagoris and Ioannis Pratikakis




      Image Processing and Multimedia Lab,
Department of Electrical and Computer Engineering,
 Democritus University of Thrace, Xanthi, Greece
    kzagoris@ee.duth.gr, ipratika@ee.duth.gr
Outline
 Introduction
 State of the Art
 Disadvantages
 Architecture of the proposed method
 Canny Edge Detector
 Coordinating Logic Filters (CLF)
 Proposed Cellular Automata Text Detection
  Method
 Evaluation and Experimental Results
Introduction
 Textual information in images or video constitutes
  a very rich source of high-level semantics for
  retrieval and indexing
 It can be acquired as scene text that was
  captured by a video or photo camera as part of a
  scene
 Text detection on natural scenes is still a hard
  task to solve
 Have very high computational cost
State of the Art
 Split in two categories: region-based and texture-
    based
    Region-based algorithms group pixels based on
    common characteristics
   Texture-based methods scan the image at
    different scales using a sliding window and
    classify text areas based on texture information.
   From another perspective, can be divided into
    heuristic-based and machine learning-based
    methods.
   Heuristic-based algorithms segment the image
    into small regions and then group them by some
    constraints
   Machine learning-based methods use directly
Disadvantages
 Many     parameters have to be estimated
  experimentally    condemns       them    to   data
  dependency and lack of generality
 When background is really complex, they
  become computationally expensive.
 Texture-based        techniques cannot catch
  satisfactory text with size bigger of the sliding
  window.
 An increase of the window make these methods
  quite costly. In addition, they still use empirical
  thresholds on specific features therefore they lack
  adaptability.
Proposed Method
 Address the scene text detection problem by
  modeling texture into cellular automata (CA)
  context
 Replace costly image processing operations with
  their equivalent cellular operations
 Eliminate most limitations, such as the empirical
  thresholds and heavy computational procedures
Architecture of the proposed method
Original Image

          Canny Edge
             Map

                 Logical OR
                                          Cellular Automata
                         Logical AND


    Coordinating Logic           Logical OR
        Filters                        Majority State
                                           Rule
                                                    Edge
                                                 Projection
                                                  Filtering
                                                        Final Text
Coordinating Logic Filters (CLF)
 execute coordinate logic operations among the
  pixels of the image
 The     CLF operations is similar to the
  morphological operations, achieving similar
  functionality
 morphology Dilation is the logical OR
 morphology Erosion is the logical AND
Canny Edge Detector
 Detection of the salient image edges
 Use Sobel masks
 thresholding and non-maxima suppression(low
  threshold equal to 20 and high threshold equal to
  100)
 The final edge map is a binarised image with the
  contour pixels set to one (white) and the
  remainder pixels equal to zero (black).
 This approach exploits the fact that text lines
  produce strong vertical edges horizontally aligned
  with a high density.
 gives us the opportunity to detect normal or
Canny Edge Detector
Proposed Cellular Automata
 The proposed CA is considered to be a 2-D lattice
  of cells where every pixel is represented by a cell.
 The CA grid width and height is defined by the
  edge image width and height
 Each cell have two states as the input image is
  binary.
 Taking advantage of the CA flexibility, the
  transition rules are changing and are applied in
  four consecutive steps resulting in four time steps
  CA evolution.
1st Step – Logical OR

1st Step – Logical OR
2nd Step – Logical AND

2nd Step Logical AND
3rd Step – Logical OR

3rd Step – Logical OR
Majority State Rule

4th Step - Majority State Rule
Edge Projection Filtering
 in the high edge density images, the method
  produces a number of false positives
 post-processing filtering is required in order to
  remove them
 filtered them based on horizontal and vertical
  projections
 Areas with mean horizontal and vertical
  projections below a threshold are discarded.
Edge Projection Filtering
Examples
Examples
Evaluation

Evaluation





    1. Wolf, C., Jolion, J.M.: Object count/area graphs for the evaluation of object
    detection and segmentation algorithms. International Journal on Document
    Analysis and Recognition 8(4), 280–296 (2006)
Experimental Results
  In order to showcase the advantages of our
   proposed method, we test it against a machine-
   learning edge based scene text detection system.
  We replace the CLF with the corresponding
   morphological operations (dilation and opening)
   and the majority state rule with the Support
   Vector Machines (SVMs) classifier
Method                   Recall   Precision   Harmonic
                                              Mean
Proposed CA-based        0.7942   0.7462      0.7652
method
Machine-learning based   0.7134   0.5234      0.6038
method
Experimental Results
Mean execution time of each of them for a set images
(15 total) in a Intel Core 2 Quad CPU Q9550
(2.83GHz) machine.

Method                    Mean Execution Time
                          (sec)
Proposed CA-based         2.75 sec
method
Machine-learning based    5.96 sec
method
Conclusions
 A method based on the Cellular Automata was
  presented for the detection of scene text on
  natural images
 Initially, the Canny edge detector is employed in
  order to exposed the dominant edges on the
  image.
 Then a CA is used for the calculation of the
  candidate text areas. Its rules depend on
  Coordinating Logic Filters and on the majority
  state rule
 A post-processing technique based on edge
  projection analysis is employed for the high
  density edge images in order to eliminated the
  false positives.
Ευχαριστώ Πολφ!


  Thank You!
Ad

More Related Content

What's hot (20)

Self-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with SmoothingSelf-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with Smoothing
Priyanka Wagh
 
Self-organizing map
Self-organizing mapSelf-organizing map
Self-organizing map
Tarat Diloksawatdikul
 
Text extraction from images
Text extraction from imagesText extraction from images
Text extraction from images
Garby Baby
 
Enhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wildEnhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wild
Prerana Mukherjee
 
Hand Written Digit Classification
Hand Written Digit ClassificationHand Written Digit Classification
Hand Written Digit Classification
ijtsrd
 
Image classification with Deep Neural Networks
Image classification with Deep Neural NetworksImage classification with Deep Neural Networks
Image classification with Deep Neural Networks
Yogendra Tamang
 
201907 AutoML and Neural Architecture Search
201907 AutoML and Neural Architecture Search201907 AutoML and Neural Architecture Search
201907 AutoML and Neural Architecture Search
DaeJin Kim
 
IRJET- Object Detection using Hausdorff Distance
IRJET-  	  Object Detection using Hausdorff DistanceIRJET-  	  Object Detection using Hausdorff Distance
IRJET- Object Detection using Hausdorff Distance
IRJET Journal
 
Btv thesis defense_v1.02-final
Btv thesis defense_v1.02-finalBtv thesis defense_v1.02-final
Btv thesis defense_v1.02-final
Vinh Bui
 
C04741319
C04741319C04741319
C04741319
IOSR-JEN
 
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence MatrixSteganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix
CSCJournals
 
Kernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of movingKernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of moving
IAEME Publication
 
Introduction to Convolutional Neural Networks
Introduction to Convolutional Neural NetworksIntroduction to Convolutional Neural Networks
Introduction to Convolutional Neural Networks
ParrotAI
 
O017429398
O017429398O017429398
O017429398
IOSR Journals
 
Sefl Organizing Map
Sefl Organizing MapSefl Organizing Map
Sefl Organizing Map
Nguyen Van Chuc
 
Ijetcas14 527
Ijetcas14 527Ijetcas14 527
Ijetcas14 527
Iasir Journals
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNet
SungminYou
 
Sub1586
Sub1586Sub1586
Sub1586
International Journal of Science and Research (IJSR)
 
Pillar k means
Pillar k meansPillar k means
Pillar k means
swathi b
 
Understanding Convolutional Neural Networks
Understanding Convolutional Neural NetworksUnderstanding Convolutional Neural Networks
Understanding Convolutional Neural Networks
Jeremy Nixon
 
Self-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with SmoothingSelf-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with Smoothing
Priyanka Wagh
 
Text extraction from images
Text extraction from imagesText extraction from images
Text extraction from images
Garby Baby
 
Enhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wildEnhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wild
Prerana Mukherjee
 
Hand Written Digit Classification
Hand Written Digit ClassificationHand Written Digit Classification
Hand Written Digit Classification
ijtsrd
 
Image classification with Deep Neural Networks
Image classification with Deep Neural NetworksImage classification with Deep Neural Networks
Image classification with Deep Neural Networks
Yogendra Tamang
 
201907 AutoML and Neural Architecture Search
201907 AutoML and Neural Architecture Search201907 AutoML and Neural Architecture Search
201907 AutoML and Neural Architecture Search
DaeJin Kim
 
IRJET- Object Detection using Hausdorff Distance
IRJET-  	  Object Detection using Hausdorff DistanceIRJET-  	  Object Detection using Hausdorff Distance
IRJET- Object Detection using Hausdorff Distance
IRJET Journal
 
Btv thesis defense_v1.02-final
Btv thesis defense_v1.02-finalBtv thesis defense_v1.02-final
Btv thesis defense_v1.02-final
Vinh Bui
 
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence MatrixSteganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix
CSCJournals
 
Kernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of movingKernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of moving
IAEME Publication
 
Introduction to Convolutional Neural Networks
Introduction to Convolutional Neural NetworksIntroduction to Convolutional Neural Networks
Introduction to Convolutional Neural Networks
ParrotAI
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNet
SungminYou
 
Pillar k means
Pillar k meansPillar k means
Pillar k means
swathi b
 
Understanding Convolutional Neural Networks
Understanding Convolutional Neural NetworksUnderstanding Convolutional Neural Networks
Understanding Convolutional Neural Networks
Jeremy Nixon
 

Viewers also liked (18)

Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Konstantinos Zagoris
 
Svm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammogramsSvm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammograms
Konstantinos Zagoris
 
Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)
Konstantinos Zagoris
 
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases
Dynamic Two-Stage Image Retrieval from Large Multimodal DatabasesDynamic Two-Stage Image Retrieval from Large Multimodal Databases
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases
Konstantinos Zagoris
 
Text Detection and Recognition
Text Detection and RecognitionText Detection and Recognition
Text Detection and Recognition
Badruz Nasrin Basri
 
Query expansion based on visual content new
Query expansion based on visual content newQuery expansion based on visual content new
Query expansion based on visual content new
Lazaros Tsochatzidis
 
Presentation iwssip2012
Presentation iwssip2012Presentation iwssip2012
Presentation iwssip2012
Bernhard Quehl
 
Detecting text from natural images with Stroke Width Transform
Detecting text from natural images with Stroke Width TransformDetecting text from natural images with Stroke Width Transform
Detecting text from natural images with Stroke Width Transform
Pooja G N
 
Text Detection From Image
Text Detection From ImageText Detection From Image
Text Detection From Image
Arun Patel
 
Text detection and recognition from natural scenes
Text detection and recognition from natural scenesText detection and recognition from natural scenes
Text detection and recognition from natural scenes
hemanthmcqueen
 
Block Emulation and Computation in One-dimensional Cellular Automata: Breakin...
Block Emulation and Computation in One-dimensional Cellular Automata: Breakin...Block Emulation and Computation in One-dimensional Cellular Automata: Breakin...
Block Emulation and Computation in One-dimensional Cellular Automata: Breakin...
Jurgen Riedel
 
Automata Invasion
Automata InvasionAutomata Invasion
Automata Invasion
lucenerevolution
 
online payment system using Steganography and Visual cryptography
online payment system using Steganography and Visual cryptographyonline payment system using Steganography and Visual cryptography
online payment system using Steganography and Visual cryptography
Shahrukh Ali
 
Text Detection Strategies
Text Detection StrategiesText Detection Strategies
Text Detection Strategies
Anyline
 
Urban Land Cover Change Detection Analysis and Modelling Spatio-Temporal Grow...
Urban Land Cover Change Detection Analysis and Modelling Spatio-Temporal Grow...Urban Land Cover Change Detection Analysis and Modelling Spatio-Temporal Grow...
Urban Land Cover Change Detection Analysis and Modelling Spatio-Temporal Grow...
Bayes Ahmed
 
Visual Cryptography Industrial Training Report
Visual Cryptography Industrial Training ReportVisual Cryptography Industrial Training Report
Visual Cryptography Industrial Training Report
Mohit Kumar
 
Visual cryptography
Visual cryptographyVisual cryptography
Visual cryptography
Shahid Zargar
 
Finite automata
Finite automataFinite automata
Finite automata
Sutee Sudprasert
 
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Konstantinos Zagoris
 
Svm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammogramsSvm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammograms
Konstantinos Zagoris
 
Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)
Konstantinos Zagoris
 
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases
Dynamic Two-Stage Image Retrieval from Large Multimodal DatabasesDynamic Two-Stage Image Retrieval from Large Multimodal Databases
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases
Konstantinos Zagoris
 
Query expansion based on visual content new
Query expansion based on visual content newQuery expansion based on visual content new
Query expansion based on visual content new
Lazaros Tsochatzidis
 
Presentation iwssip2012
Presentation iwssip2012Presentation iwssip2012
Presentation iwssip2012
Bernhard Quehl
 
Detecting text from natural images with Stroke Width Transform
Detecting text from natural images with Stroke Width TransformDetecting text from natural images with Stroke Width Transform
Detecting text from natural images with Stroke Width Transform
Pooja G N
 
Text Detection From Image
Text Detection From ImageText Detection From Image
Text Detection From Image
Arun Patel
 
Text detection and recognition from natural scenes
Text detection and recognition from natural scenesText detection and recognition from natural scenes
Text detection and recognition from natural scenes
hemanthmcqueen
 
Block Emulation and Computation in One-dimensional Cellular Automata: Breakin...
Block Emulation and Computation in One-dimensional Cellular Automata: Breakin...Block Emulation and Computation in One-dimensional Cellular Automata: Breakin...
Block Emulation and Computation in One-dimensional Cellular Automata: Breakin...
Jurgen Riedel
 
online payment system using Steganography and Visual cryptography
online payment system using Steganography and Visual cryptographyonline payment system using Steganography and Visual cryptography
online payment system using Steganography and Visual cryptography
Shahrukh Ali
 
Text Detection Strategies
Text Detection StrategiesText Detection Strategies
Text Detection Strategies
Anyline
 
Urban Land Cover Change Detection Analysis and Modelling Spatio-Temporal Grow...
Urban Land Cover Change Detection Analysis and Modelling Spatio-Temporal Grow...Urban Land Cover Change Detection Analysis and Modelling Spatio-Temporal Grow...
Urban Land Cover Change Detection Analysis and Modelling Spatio-Temporal Grow...
Bayes Ahmed
 
Visual Cryptography Industrial Training Report
Visual Cryptography Industrial Training ReportVisual Cryptography Industrial Training Report
Visual Cryptography Industrial Training Report
Mohit Kumar
 
Ad

Similar to Scene Text Detection on Images using Cellular Automata (20)

CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMSCLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
sipij
 
CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMSCLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
sipij
 
Classification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization AlgorithmsClassification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization Algorithms
sipij
 
Classification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization AlgorithmsClassification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization Algorithms
sipij
 
Classification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization AlgorithmsClassification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization Algorithms
sipij
 
Classification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization AlgorithmsClassification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization Algorithms
sipij
 
Dj31514517
Dj31514517Dj31514517
Dj31514517
IJMER
 
Dj31514517
Dj31514517Dj31514517
Dj31514517
IJMER
 
IEEE 2014 Matlab Projects
IEEE 2014 Matlab ProjectsIEEE 2014 Matlab Projects
IEEE 2014 Matlab Projects
Vijay Karan
 
IEEE 2014 Matlab Projects
IEEE 2014 Matlab ProjectsIEEE 2014 Matlab Projects
IEEE 2014 Matlab Projects
Vijay Karan
 
TEXT DETECTION AND EXTRACTION FROM VIDEOS USING ANN BASED NETWORK
TEXT DETECTION AND EXTRACTION FROM VIDEOS USING ANN BASED NETWORKTEXT DETECTION AND EXTRACTION FROM VIDEOS USING ANN BASED NETWORK
TEXT DETECTION AND EXTRACTION FROM VIDEOS USING ANN BASED NETWORK
ijscai
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
theijes
 
Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w...
Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w...Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w...
Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w...
IOSR Journals
 
Enhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical RecordsEnhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical Records
csandit
 
JPM1407 Exposing Digital Image Forgeries by Illumination Color Classification
JPM1407   Exposing Digital Image Forgeries by Illumination Color ClassificationJPM1407   Exposing Digital Image Forgeries by Illumination Color Classification
JPM1407 Exposing Digital Image Forgeries by Illumination Color Classification
chennaijp
 
Segmentation of Images by using Fuzzy k-means clustering with ACO
Segmentation of Images by using Fuzzy k-means clustering with ACOSegmentation of Images by using Fuzzy k-means clustering with ACO
Segmentation of Images by using Fuzzy k-means clustering with ACO
IJTET Journal
 
Design and implementation of video tracking system based on camera field of view
Design and implementation of video tracking system based on camera field of viewDesign and implementation of video tracking system based on camera field of view
Design and implementation of video tracking system based on camera field of view
sipij
 
Matlab abstract 2016
Matlab abstract 2016Matlab abstract 2016
Matlab abstract 2016
Nexgen Technology
 
Effective Object Detection and Background Subtraction by using M.O.I
Effective Object Detection and Background Subtraction by using M.O.IEffective Object Detection and Background Subtraction by using M.O.I
Effective Object Detection and Background Subtraction by using M.O.I
IJMTST Journal
 
Matlab 2013 14 papers astract
Matlab 2013 14 papers astractMatlab 2013 14 papers astract
Matlab 2013 14 papers astract
IGEEKS TECHNOLOGIES
 
CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMSCLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
sipij
 
CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMSCLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
CLASSIFICATION AND COMPARISON OF LICENSE PLATES LOCALIZATION ALGORITHMS
sipij
 
Classification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization AlgorithmsClassification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization Algorithms
sipij
 
Classification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization AlgorithmsClassification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization Algorithms
sipij
 
Classification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization AlgorithmsClassification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization Algorithms
sipij
 
Classification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization AlgorithmsClassification and Comparison of License Plates Localization Algorithms
Classification and Comparison of License Plates Localization Algorithms
sipij
 
Dj31514517
Dj31514517Dj31514517
Dj31514517
IJMER
 
Dj31514517
Dj31514517Dj31514517
Dj31514517
IJMER
 
IEEE 2014 Matlab Projects
IEEE 2014 Matlab ProjectsIEEE 2014 Matlab Projects
IEEE 2014 Matlab Projects
Vijay Karan
 
IEEE 2014 Matlab Projects
IEEE 2014 Matlab ProjectsIEEE 2014 Matlab Projects
IEEE 2014 Matlab Projects
Vijay Karan
 
TEXT DETECTION AND EXTRACTION FROM VIDEOS USING ANN BASED NETWORK
TEXT DETECTION AND EXTRACTION FROM VIDEOS USING ANN BASED NETWORKTEXT DETECTION AND EXTRACTION FROM VIDEOS USING ANN BASED NETWORK
TEXT DETECTION AND EXTRACTION FROM VIDEOS USING ANN BASED NETWORK
ijscai
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
theijes
 
Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w...
Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w...Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w...
Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w...
IOSR Journals
 
Enhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical RecordsEnhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical Records
csandit
 
JPM1407 Exposing Digital Image Forgeries by Illumination Color Classification
JPM1407   Exposing Digital Image Forgeries by Illumination Color ClassificationJPM1407   Exposing Digital Image Forgeries by Illumination Color Classification
JPM1407 Exposing Digital Image Forgeries by Illumination Color Classification
chennaijp
 
Segmentation of Images by using Fuzzy k-means clustering with ACO
Segmentation of Images by using Fuzzy k-means clustering with ACOSegmentation of Images by using Fuzzy k-means clustering with ACO
Segmentation of Images by using Fuzzy k-means clustering with ACO
IJTET Journal
 
Design and implementation of video tracking system based on camera field of view
Design and implementation of video tracking system based on camera field of viewDesign and implementation of video tracking system based on camera field of view
Design and implementation of video tracking system based on camera field of view
sipij
 
Effective Object Detection and Background Subtraction by using M.O.I
Effective Object Detection and Background Subtraction by using M.O.IEffective Object Detection and Background Subtraction by using M.O.I
Effective Object Detection and Background Subtraction by using M.O.I
IJMTST Journal
 
Ad

Scene Text Detection on Images using Cellular Automata

  • 1. Scene Text Detection on Images using Cellular Automata Konstantinos Zagoris and Ioannis Pratikakis Image Processing and Multimedia Lab, Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi, Greece kzagoris@ee.duth.gr, ipratika@ee.duth.gr
  • 2. Outline  Introduction  State of the Art  Disadvantages  Architecture of the proposed method  Canny Edge Detector  Coordinating Logic Filters (CLF)  Proposed Cellular Automata Text Detection Method  Evaluation and Experimental Results
  • 3. Introduction  Textual information in images or video constitutes a very rich source of high-level semantics for retrieval and indexing  It can be acquired as scene text that was captured by a video or photo camera as part of a scene  Text detection on natural scenes is still a hard task to solve  Have very high computational cost
  • 4. State of the Art  Split in two categories: region-based and texture- based  Region-based algorithms group pixels based on common characteristics  Texture-based methods scan the image at different scales using a sliding window and classify text areas based on texture information.  From another perspective, can be divided into heuristic-based and machine learning-based methods.  Heuristic-based algorithms segment the image into small regions and then group them by some constraints  Machine learning-based methods use directly
  • 5. Disadvantages  Many parameters have to be estimated experimentally condemns them to data dependency and lack of generality  When background is really complex, they become computationally expensive.  Texture-based techniques cannot catch satisfactory text with size bigger of the sliding window.  An increase of the window make these methods quite costly. In addition, they still use empirical thresholds on specific features therefore they lack adaptability.
  • 6. Proposed Method  Address the scene text detection problem by modeling texture into cellular automata (CA) context  Replace costly image processing operations with their equivalent cellular operations  Eliminate most limitations, such as the empirical thresholds and heavy computational procedures
  • 7. Architecture of the proposed method Original Image Canny Edge Map Logical OR Cellular Automata Logical AND Coordinating Logic Logical OR Filters Majority State Rule Edge Projection Filtering Final Text
  • 8. Coordinating Logic Filters (CLF)  execute coordinate logic operations among the pixels of the image  The CLF operations is similar to the morphological operations, achieving similar functionality  morphology Dilation is the logical OR  morphology Erosion is the logical AND
  • 9. Canny Edge Detector  Detection of the salient image edges  Use Sobel masks  thresholding and non-maxima suppression(low threshold equal to 20 and high threshold equal to 100)  The final edge map is a binarised image with the contour pixels set to one (white) and the remainder pixels equal to zero (black).  This approach exploits the fact that text lines produce strong vertical edges horizontally aligned with a high density.  gives us the opportunity to detect normal or
  • 11. Proposed Cellular Automata  The proposed CA is considered to be a 2-D lattice of cells where every pixel is represented by a cell.  The CA grid width and height is defined by the edge image width and height  Each cell have two states as the input image is binary.  Taking advantage of the CA flexibility, the transition rules are changing and are applied in four consecutive steps resulting in four time steps CA evolution.
  • 12. 1st Step – Logical OR 
  • 13. 1st Step – Logical OR
  • 14. 2nd Step – Logical AND 
  • 16. 3rd Step – Logical OR 
  • 17. 3rd Step – Logical OR
  • 19. 4th Step - Majority State Rule
  • 20. Edge Projection Filtering  in the high edge density images, the method produces a number of false positives  post-processing filtering is required in order to remove them  filtered them based on horizontal and vertical projections  Areas with mean horizontal and vertical projections below a threshold are discarded.
  • 25. Evaluation  1. Wolf, C., Jolion, J.M.: Object count/area graphs for the evaluation of object detection and segmentation algorithms. International Journal on Document Analysis and Recognition 8(4), 280–296 (2006)
  • 26. Experimental Results  In order to showcase the advantages of our proposed method, we test it against a machine- learning edge based scene text detection system.  We replace the CLF with the corresponding morphological operations (dilation and opening) and the majority state rule with the Support Vector Machines (SVMs) classifier Method Recall Precision Harmonic Mean Proposed CA-based 0.7942 0.7462 0.7652 method Machine-learning based 0.7134 0.5234 0.6038 method
  • 27. Experimental Results Mean execution time of each of them for a set images (15 total) in a Intel Core 2 Quad CPU Q9550 (2.83GHz) machine. Method Mean Execution Time (sec) Proposed CA-based 2.75 sec method Machine-learning based 5.96 sec method
  • 28. Conclusions  A method based on the Cellular Automata was presented for the detection of scene text on natural images  Initially, the Canny edge detector is employed in order to exposed the dominant edges on the image.  Then a CA is used for the calculation of the candidate text areas. Its rules depend on Coordinating Logic Filters and on the majority state rule  A post-processing technique based on edge projection analysis is employed for the high density edge images in order to eliminated the false positives.
  翻译: