Optimizing network performance through AI-driven media-to-text conversion

Authors

  • Ali Hussein Alnooh University of Information Technology and Communications, Iraq
  • Nawar A. Sultan Northern Technical University, Iraq
  • Ali Yasir Kuti University of Information Technology and Communications, Iraq

DOI:

https://doi.org/10.37868/sei.v8i1.id793

Abstract

One of the most challenging issues in networks and cloud computing is the overhead caused by the exchanged data. Several approaches in the literature have been proposed to mitigate this issue. However, there is still a lack of innovative methods and techniques to reduce the network's overhead. Hence, this work proposes an AI-driven method that converts media data (e.g., images, audio, or videos) into text to enhance the overall network performance. The proposed method is evaluated based on bandwidth savings, throughput, and latency. The findings demonstrate that the proposed method achieves a 98% bandwidth reduction and a 3.6 times higher throughput, with high accuracy (BLEU-4 > 0.78 for captions, WER < 12% for speech). Moreover, the statistical validation shows a significant improvement in latency (150ms for audio and 950ms for video) and a packet loss rate of 0.3%. Finally, the proposed method is considered adaptable to IoT, edge computing, and cloud systems due to its cost-effectiveness.

Published

2026-04-09

How to Cite

[1]
A. H. Alnooh, N. A. Sultan, and A. Y. Kuti, “Optimizing network performance through AI-driven media-to-text conversion”, Sustainable Engineering and Innovation, vol. 8, no. 1, pp. 139-150, Apr. 2026.

Issue

Section

Articles