Projects

Course Projects

Ph.D Course Projects

  • Visual Question Answering, (Computer Vision), 2016.[Proposal][Poster]

  • Object Recognition and Localization, (Selected Topics of Image Processing), 2016.[Presentation][Report]

  • Direction of Arrival Based Spatial Covariance Model For Blind Source Separation, (Speech Signal Processing), 2016.[Presentation]

  • Robust Video Stabilization Based on Particle Filter Tracking of Projected Camera Motion,(Video Processing), 2016.[Report][Demo-Input][Demo-Output]

M.Tech Course Projects

  • Design and implementing Run length encoding, Barrel Shifter, floating point adder & Bus behavior using VHDL and Verilog, (VLSI Design Lab), 2011.

  • SENSE: Sensitive Encoding technique for Fast MRI using Back Projection, (Medical Image Processing), 2010.

  • An Semi-Autonomous, External Command Reading White line Follower Robot, (Embedded System-Robotics), 2010.[Report]

  • Adaptive Beam forming using microphone array for hands free Telephony with the help of generalized side lobe technique, (Adaptive Signal Processing), 2010.[Report]

  • Detection of Duplicate Forgery in Handwritten Signature using Statistical DWT & EDM, (Wavelet Transform), 2010.[Report][Poster]

  • Frequency Code(LFM) and Phase code(Barker code) Pulse Compression Techniques in Mono Pulse Radar, (Digital Signal Processing), 2009.[Report]

Industrial Workshop and Summer School

  • Summer School on Machine Learning using Deep Learning(optimization for DL,GAN, VAE, DL for RL and game theory), IIIT-H, July, 2017.

  • Summer School on Advance Computer Vision using Deep Learning(DL)(DL for vision and language(Caption, VQA), DL for videos, object detection, semantic segmenta-tion,Domain Adaption, and advances in 3D), IIIT-H, July, 2017.

  • Mysore Park Workshop on Vision, Language and AI(Video Caption, guided LSTM,GAN, Adversarial auto-encoders, reinforcement learning, deep contextual mod-els), VLAI 2016, Mysore, Dec, 2016.

  • Summer School on computer vision using Deep Learning(CNN, RNN, Auto-encoder,optimization for DL, Symbolic deep learning & face, pose and Egocentric actionrecognition, model compression), IIIT-H, July, 2016.

  • Audio Engineering(Acoustics, Recording, Broadcasting Technology, Surround Sound,Microphones& Speakers) & Audio Post Processing(Harman International), 2012.

  • Software Architecture of DaVinci Multimedia Processor-DM6437(APL, SPL, IOL) and Video Processing Subsystem(VPFE, VPBE) (IIT Bombay, India), 2011.

Poster Presentation

  • Multimodal Differential Network for Visual Question Generation, EMNLP 2018, Brussels, Belgium, 2018.

  • Learning Semantic Sentence Embeddings using Pair-wise Discriminator, COLING 2018, Santa Fe, New Mexico, USA, 2018.

  • Differential Attention for Visual Question Answering, CVPR , Salt Lake City, Utah, USA, 2018.

  • Visual Question Answering, Summer School on Advance Computer Vision, IIIT-H, July, 2017.

  • Visual Question Answering, (Computer Vision), 2016.

Award

  • Received Student Volunteer Award from EMNLP and Conference Travel Grant from Microsoft India for EMNLP 2018.

  • Received Student Volunteer Award from CVF and Conference Travel Grant from Google India for CVPR 2018.

  • Received Conference Travel Grant from IIT Kanpur for Coling 2018.

  • Selected in Quiz competition in Deep learning summer school for vision, IIITH,2017

  • Selected in Quiz competition in Deep learning summer school for ML, IIITH,2017

Industrial Seminars

  • MPEG-2 Transport Stream Standard(ISO/IEC-13818-1)– PAT, PMT, Descriptor, Section, TS, PES and ES information (Samsung R&D), 2014.

  • ATSC System Information Standard–A/53 part-1, A/65 and CEA-708,608 for Close Caption Decoder(Samsung R&D), 2014.

  • DVB Service Information Standard –EN 300468 and EN-300743 Subtitle Decoder (Samsung R&D), 2014.

  • Digital Audio Processing–Audio Representation, Compression, Microphones, and Speakers module and Audio post processing (Samsung R&D), 2013.

  • Forward Error Correction Techniques– Uneven Length Protection(Samsung R&D), 2013.

  • DSP algorithm and Filter Design– FIR/IIR digital filter and transform technique(DFT, DCT, DST, FFT and Wavelet) (Harman International) 2012.