Video and Text Summarization Using VDAN and RNN

  • Unique Paper ID: 152248
  • Volume: 8
  • Issue: 2
  • PageNo: 780-786
  • Abstract:
  • The main intent of this project is to develop a video and text summarizer. The videos in these days are so quite long. It is difficult for people with hectic work schedule to find time to watch the long videos. Thus a summarizer will help people in getting the gist immediately. The video summarization is done with the help of Visually-Guided Document Attention Network (VDAN).The motive of this network is to extract the textual and visual features. The extraction of visual features is done with the help of Convolutional Neural Network(CNN).The extraction of textual features is done with the help of document level encoding. It also contains Gated Recurrent Unit (GRU).Based on the visual and textual features extracted, the agent decides the corresponding action. The three sets of actions are accelerate, decelerate and do nothing. The text summarization part is done with the help of Recurrent Neural Network. It follows an encoder-decoder architecture. It also makes use of Long Short Term Memory (LSTM) to keep track of the previous observations. Thus at the end the summarized video and text are available.

Cite This Article

  • ISSN: 2349-6002
  • Volume: 8
  • Issue: 2
  • PageNo: 780-786

Video and Text Summarization Using VDAN and RNN

Related Articles