I. Introduction
Scientists were originally tasked with solving the challenge of language translation, and work on it has persisted for more than four decades. MT's current condition was the result of years of collaboration between linguists and computer scientists. First, MT relied on dictionary matching approaches, but rule-based systems have since taken their place. In recent years, SMT has been widely adopted in most Machine Translation Systems (MTS). Words, phrases, and sentences are the fundamental translation units in these systems [1], [2], and [3]. Most classic translation systems employ Bayesian inference to predict the likelihood of a pair of words being translated. One phrase represents the language of the source, while the second phase represents the language of the target. The frequency of words is exceedingly low, making it extremely difficult to pair and forecast the proper pair. Increased data set size was one of the most viable strategies for increasing the probability of a given pair of phrases. [4] As standard MTS are limited by their dependence on huge datasets, there is a need for alternate MT techniques.