IJFANS International Journal of Food and Nutritional Sciences

Volume 13 Issue 4

Transformative Impact of Food Additives in Breads
Volume 13 | Issue 4

“Development and validation of an RP-HPLC method for the simultaneous estimation of praziquantel and ivermectin in bulk and combined tablet dosage form.”
Volume 13 | Issue 4

BLACK APPLE: AN INSPECT OF ITS NUTRITIONAL AND MEDICINAL HEALTH BENEFITS
Volume 13 | Issue 4

Evaluation of Career Competencies in Adolescent Students Based on their Stress Coping Strategies
Volume 13 | Issue 4

Effect of Biostimulants (Azospirillum, Pseudomonas and Bacillus) on the growth and disease suppression of neem Azadirachta indica (A) juss.seedlings
Volume 13 | Issue 4

Recognizing sign language from many camera angles using 2D video skeleton data

PDF

Keywords:

Ch.Raghava Prasad
» doi: 10.48047/IJFANS/11/S6/004

Abstract

In this study, we suggest a method for view-oriented feature fusion (VOFF) in multi stream CNNs. Here, we use nine different perspectives to train a CNN model. Each of the nine perspectives can be further broken down into thirds based on the camera's position relative to the action: middle, far left, and far right. Following their training by the dense networks, these three subsets are fused together using a common set of features. Once all 3 softmax layers' scores have been accumulated, a prediction is made. As demonstrated by the results, a strong discriminative view feature vector may be constructed by fusing spatial features across different viewpoints. This fusion technique generates good view feature distribution but fails to distinguish between signs that are visually identical across several views. To address the aforementioned difficulty, researchers have investigated using a contrastive network with triple loss embedding (CNTLE). Within this framework, the viewpoints are coupled as a support set consisting of pro-class and anti-class perspectives. During training, CNN networks are subjected to global cross entropy losses and view-specific triplet losses. The model's shortcomings have been mitigated thanks to the solution, which pairs perceptions of inter-class homogeneous physical appearance with unfavorable evaluations. Because of this, the model was able to provide view invariant features for classification that were satisfactory.

Issue

Volume 11, Special Issue 6 (2022 )

Submit article