This document provides an overview of using convolutional neural networks (CNNs) for video concept detection. It discusses how CNNs can be used to extract features from keyframes of video shots for classification. CNNs pretrained on ImageNet are used to extract 4096-dimensional feature vectors from each keyframe. These features are then input to classifiers like support vector machines to predict the presence of semantic concepts in the video shots, based on a previously trained model. The proposed method aims to develop a high-performance video concept detection system using deep CNNs.