Review on Scene Semantics Extraction for Decision Making System in Autonomous Vehicles

Yuvraj Bapu Hembade

doi:10.26483/ijarcs.v13i4.6882

PDF

Published: 2022-08-21

DOI: https://doi.org/10.26483/ijarcs.v13i4.6882

Keywords:

Keywords, Autonomous Vehicles, Deep Convolutional Neural Networks, Scene Semantics, Object Detection.

Yuvraj Bapu Hembade

Savitribai Phule Pune University

https://orcid.org/0000-0003-0583-7224

Abstract

Abstract: It is a worldwide witnessed fact that traditional manual driving mechanism will be superseded by Autonomous Vehicles [AVs] in coming years. Autonomous vehicles are going to be most foreseen development in the automotive industry. That would require Decision Making System which will enable AVs to intuitively interpret the real-time situations around. Most importantly scene recognition on streets & extracting relevant semantics from the scene is challenging task. So, image classification & object detection techniques using Deep Convolutional Neural Networks [DCNN] are going to play vital role in every other methodology designed for scene semantics extraction. As per the extracted scene semantics DMS actuates the necessary devices which control the speed of vehicle & steering angel. So for that matter information extraction from road scene images covering all aspects to take intuitive decisions has huge concern with overall performance of the AVâ€™s.

Â

Downloads

Download data is not yet available.

Issue

Vol. 13 No. 4 (2022): July-August 2022

Section

Articles

COPYRIGHT

Submission of a manuscript implies: that the work described has not been published before, that it is not under consideration for publication elsewhere; that if and when the manuscript is accepted for publication, the authors agree to automatic transfer of the copyright to the publisher.

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work
The journal allows the author(s) to retain publishing rights without restrictions.
The journal allows the author(s) to hold the copyright without restrictions.

References

L. Chen, W. Zhan, W. Tian, Y. He and Q. Zou, "Deep Integration: A Multi-Label Architecture for Road Scene Recognition," in IEEE Transactions on Image Processing, vol. 28, no. 10, pp. 4883-4898, Oct. 2019. doi: 10.1109/TIP.2019.2913079

A. J. Davison, I. D. Reid, N. D. Molton, and O. Stasse, â€œMono SLAM: Real-time single camera SLAM,â€ IEEE Transactions on Pattern Analysis & Machine Intelligence, no. 6, pp. 1052â€“1067, 2007.

C. Cadena, L. Carlone, H. Carrillo, Y. Latif, D. Scaramuzza, J. Neira, I. Reid, and J. J. Leonard, â€œPast, present, and future of simultaneous localization and mapping: Toward the robust-perception age,â€ IEEE Transactions on Robotics, vol. 32, no. 6, pp. 1309â€“1332, 2016.

Q. Li, L. Chen, M. Li, S. Shaw, and A. Nuchter, â€œA sensor-fusion drivable-region and lane-detection system for autonomous vehicle navigation in challenging road scenarios,â€ IEEE Transactions on Vehicular Technology, vol. 63, no. 2, pp. 540â€“555, 2014.

D. Gonzlez, J. Prez, V. Milans, and F. Nashashibi, â€œA review of motion planning techniques for automated vehicles,â€ IEEE Transactions on Intelligent Transportation Systems, vol. 17, no. 4, pp. 1135â€“1145, 2016.

L. Chen, L. Fan, G. Xie, K. Huang, and A. Nuchter, â€œMoving-object detection from consecutive stereo pairs using slanted plane smoothing,â€ IEEE Transactions on Intelligent Transportation Systems,vol.18,no.11, pp. 3093â€“3102, 2017.

L. Chen, X. Hu, T. Xu, H. Kuang, and Q. Li, â€œTurn signal detection during night time by cnn detector and perceptual hashing tracking,â€ IEEE Transactions on Intelligent Transportation Systems, vol. 18, no. 12, pp. 3303â€“3314, 2017.

M. Cordts, M. Omran, S. Ramos, T. Rehfeld, M. Enzweiler, R. Benenson, U. Franke, S. Roth, and B. Schiele, â€œThe cityscapes dataset for semantic urban scene understanding,â€ IEEE Conference on Computer Vision and Pattern Recognition, pp. 3213â€“3223, 2016.

J. Deng, W. Dong, R. Socher, L. Li, K. Li, and L. Feifei, â€œImagenet: A large-scale hierarchical image database,â€ European Conference on Computer Vision, pp. 248â€“255, 2009.

B. Zhou, A. Lapedriza, A. Khosla, A. Oliva, and A. Torralba, â€œPlaces: A 10 million image database for scene recognition,â€ IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 6, pp. 1452â€“ 1464, 2018.

S. Ren, K. He, R. B. Girshick, and J. Sun, â€œFaster r-cnn: Towards real time object detection with region proposal networks,â€ IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 6, pp. 1137â€“ 1149, 2017.

J. Long, E. Shelhamer, and T. Darrell, â€œFully convolutional networks for semantic segmentation,â€ computer vision and pattern recognition, pp. 3431â€“3440, 2015.

Q. Zou, Z. Zhang, Q. Li, X. Qi, Q. Wang, and S. Wang, â€œDeepcrack: Learning hierarchical convolutional features for crack detection,â€ IEEE Transactions on Image Processing, vol. 28, no. 3, pp. 1498â€“1512, 2019.

N. Mayer, E. Ilg, P. Hausser, P. Fischer, D. Cremers, A. Dosovitskiy, and T. Brox, â€œA large dataset to train convolutional networks for disparity, optical ï¬‚ow, and scene ï¬‚ow estimation,â€ computer vision and pattern recognition, pp. 4040â€“4048, 2016.

L. Chen, M. Cui, F. Zhang, B. Hu, and K. Huang, â€œHigh speed scene ï¬‚ow on embedded commercial-off-the-shelf systems,â€ IEEE Transactions on Industrial Informatics, pp. 1â€“1, 2018.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, â€œImagenet classiï¬cation with deep convolutional neural networks,â€ in Advances in neural information processing systems, 2012, pp. 1097â€“1105.

K. Simonyan and A. Zisserman, â€œVery deep convolutional networks for large-scale image recognition,â€ International Conference on Learning Representations, 2015.

C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. E. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, â€œGoing deeper with convolutions,â€ IEEE Conference on Computer Vision and Pattern Recognition, pp. 1â€“9, 2015.

J. Xiao, J. Hays, K. A. Ehinger, A. Oliva, and A. Torralba, â€œSun database: Large-scale scene recognition from abbey to zoo,â€ IEEE Conference on Computer Vision and Pattern Recognition, pp. 3485â€“ 3492, 2010.

A. Geiger, P. Lenz, and R. Urtasun, â€œAre we ready for autonomous driving? the kitti vision benchmark suite,â€ IEEE Conference on Computer Vision and Pattern Recognition, pp. 3354â€“3361, 2012.

L. Yang, P. Luo, C. C. Loy, and X. Tang, â€œA large-scale car dataset for ï¬ne-grained categorization and veriï¬cation,â€ IEEE Conference on Computer Vision and Pattern Recognition, pp. 3973â€“3981, 2015.

I. SikiriÂ´c, K. BrkiÂ´c, J. Krapac, and S. Ë‡SegviÂ´c, â€œImage representations on a budget: Trafï¬c scene classiï¬cation in a restricted bandwidth scenario,â€ IEEE Intelligent Vehicles Symposium, 2014.

Y. Luo, T. Liu, D. Tao, and C. Xu, â€œMulti view matrix completion for multi label image classiï¬cation,â€ IEEE Transactions on Image Processing, vol. 24, no. 8, pp. 2355â€“2368, 2015.

X. Li, X. Zhao, Z. Zhang, F. Wu, Y. Zhuang, J. Wang, and X. Li, â€œJoint multi label classiï¬cation with community-aware label graph learning,â€ IEEE Transactions on Image Processing, vol. 25, no. 1, pp. 484â€“493, 2016.

J. Wang, Y. Yang, J. Mao, Z. Huang, C. Huang, and W. Xu, â€œCnnrnn: A uniï¬ed framework for multi-label image classiï¬cation,â€ IEEE Conference on Computer Vision and Pattern Recognition, pp. 2285â€“ 2294, 2016.

H. Lai, P. Yan, X. Shu, Y. Wei, and S. Yan, â€œInstance-aware hashing for multi-label image retrieval,â€ IEEE Transactions on Image Processing, vol. 25, no. 6, pp. 2469â€“2479, 2016.

M. Oquab, L. Bottou, I. Laptev, and J. Sivic, â€œLearning and transferring mid-level image representations using convolutional neural networks,â€ IEEE conference on computer vision and pattern recognition, pp. 1717â€“ 1724, 2014.

L. Wang, S. Guo, W. Huang, Y. Xiong, and Y. Qiao, â€œKnowledge guided disambiguation for large-scale scene classiï¬cation with multi-resolution cnns,â€ IEEE Transactions on Image Processing, 2017.

K. He, X. Zhang, S. Ren, and J. Sun, â€œSpatial pyramid pooling in deep convolutional networks for visual recognition,â€ IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 9, pp. 1904â€“ 1916, 2015.

K. He, X. Zhang, S. Ren, and J. Sun, â€œDeep residual learning for image recognition,â€ IEEE Conference on Computer Vision and Pattern Recognition, pp. 770â€“778, 2016.

T. Lin, M. Maire, S. J. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick, â€œMicrosoft coco: Common objects in context,â€ IEEE Conference on European Conference on Computer Vision, pp. 740â€“755, 2014.

K. Simonyan and A. Zisserman, â€œVery deep convolutional networks for large-scale image recognition,â€ International Conference on Learning Representations, 2015.

C. Szegedy,V. Vanhoucke, S. Ioffe,J. Shlens, and Z.Wojna, â€œRethinking the inception architecture for computer vision,â€ IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818â€“2826, 2016.

C. Huang, Y. Li, C. C. Loy, and X. Tang, â€œLearning deep representation for imbalanced classiï¬cation,â€ IEEE conference on computer vision and pattern recognition, pp. 5375â€“5384, 2016.

C. Szegedy, S. Ioffe, V. Vanhoucke, and A. Alemi. (2016). â€œInception-v4, inception-ResNet and the impact of residual connections on learning.â€ [Online]. Available: https://arxiv.org/abs/1602.07261

A. Oliva and A. Torralba, â€œModeling the shape of the scene: A holistic representation of the spatial envelope,â€ Int. J. Comput. Vis., vol. 42, no. 3, pp. 145â€“175, 2001.

A. Veit, N. Alldrin, G. Chechik, I. Krasin, A. Gupta, and S. Belongie. (2017). â€œLearning from noisy large-scale datasets with minimal supervision.â€ [Online]. Available: https://arxiv.org/abs/1701.01619

L. Li, K. Ota and M. Dong, "Humanlike Driving: Empirical Decision-Making System for Autonomous Vehicles," in IEEE Transactions on Vehicular Technology, vol. 67, no. 8, pp. 6814-6823, Aug. 2018, doi: 10.1109/TVT.2018.2822762.

Yuan, S.; Chen, Y.; Huo, H.; Zhu, L. Analysis and Synthesis of Traffic Scenes from Road Image Sequences. Sensors 2020, 20, 6939. https://doi.org/10.3390/s20236939

https://idd.insaan.iiit.ac.in/dataset/download/

W. Zhiqiang and L. Jun, "A review of object detection based on convolutional neural network," 2017 36th Chinese Control Conference (CCC), 2017, pp. 11104-11109, doi: 10.23919/ChiCC.2017.8029130.

https://cloud.google.com/tpu/docs/inception-v3-advanced

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Issue

Section

References