Analysis of Machine Learning and Computer Vision Emphasizing Unified Ant Colony Optimization

Abstract

Through this paper, we analyze specific branches of Artificial Intelligence (AI) technologies, such as Machine Learning (ML) and Computer Vision (CV). Specifically, we explore synergies between ML and CV, highlighting how Unified Ant Colony Optimization (ACO) serves as a unifying optimization technique in addressing complex problems within these domains. We also delve into the intricacies of ML and CV, examining both their theoretical foundations and practical applications, before exploring the Unified ACO algorithm and considering how it can be utilized to address difficult computational problems.

Introduction

§I Introduction

The Question of AI Consciousness

The prevalence of artificial intelligence technologies like Machine Learning (ML) have gained popularity in applied sciences, such as medicine and Computer Vision (CV). However, we pose the question of whether artificial intelligence has a consciousness and how it would change the state of the matter if it was proven that an AI was conscious, although it is not possible to prove. It is first possible to estimate that a network of neural nodes cannot produce a conscious mind that mimics the neurons or cognitive ability possessed by the human brain. We address this question for further investigation from a psychological point of view with the main focus on cognitive abilities of the artificial intelligence which cannot be achieved by simply simulating the neural network, regardless of computational power.

Artificial Intelligence (AI) and its consciousness is a general question of the modern Computer Science, which is highlighted in the press by many researchers [1, 2]. We seek to answer the question of whether it is important and whether it is possible. First, we give our argument towards the fact that psychology and self "I" of any mind cannot be followed from the chaos produced by the neural network. Thus, all the arguments made towards the fact that AI can be conscious based upon latest knowledge and technology cannot be addressed to the main point of view as per analogy of the human "I" and his or her modus operandi.

When considering the theoretical consciousness of AI, we turn our attention to psychological, psychical and physiological processes of the human brain. In these processes, signals are transferred through interconnected neurons between points. However, even when neural networks simulate this signal transmission, they still fail to sufficiently mimic the human brain to a point where we could consider it conscious. Additionally, all our discussion thus far has suggested that consciousness is merely cognitive in nature, which we do not believe to be the case. We can look at these limitations through psychiatry, likening it to what we can call a "Freud conjecture." This is to say that AI systems do not instinctively develop a sense of self of "I" in the way that humans are instinctively self-aware. As a result, even a computationally sufficient AI would not resemble consciousness.

Beyond AI consciousness, we give the definition of the algorithm of Explainable Machine Learning in order to define the extensible role of ML in the field of algorithmic approach rather than resorting to the classical approach of neural networks.

Economic and Practical Considerations

The data volume as a starting point of view are very large (hundreds of gigabytes of pure textual data) in order to train the neural network. This is a very challenging task as acquiring such a large amount of data and successful storage of the trained neural network necessitates investing in Big Data hardware. Thus, Big Data for AI poses economic challenges, since many cloud based services are offered on a "freemium" business model, which is not sustainable from an economic standpoint with high computational resource expenses.

Thus, beyond the theoretical possibility of creating an AI that is truly comparable to human consciousness, we find that it is not feasible (at least at present) for commercialization given the high costs of developing a sufficiently computationally advanced system. Additionally, we suggest that many AI and software engineers are disincentivized from creating an AI resembling consciousness, given that such an AI would be capable of replacing their job, leading to personal negative financial ramifications.

Further, we should consider whether there is considerable harm in creating an AI that resembles consciousness. It is possible to train the neural network against utilizing the prohibited type of data, and against allowing for users to submit harmful queries (in the case of LLMs). Thus, if we, for example, would train the network against the binary codes of executables and their source codes, this will lead to the reverse engineering which is a prohibited method of obtaining source codes of programs from their packaged content. The same malicious method can be applied to password brute force attacks and other tasks which involve the example to be selected from a large amount of sample data.

ChatGPT and other modern tools utilizing artificial intelligence are already beginning to implement the use of automated assistants in business processes. This still poses economic considerations; however, due to the existing capabilities of the automated solutions, there's no need for human oversight. Thus, the economic benefit or harm of proficient artificial intelligence systems remains an open question in the global labor market.

Swarm Optimization

§II Unified Ant Colony Optimization (ACO) / Swarm Optimization

Swarm optimization is a common term for ACO, which is simpler to implement. Our key result is to represent the unified form of this algorithm for further framework-like implementation, which is already done in the "AntTSP" software package.

The ACO Algorithm

The unified ACO can be represented by the following starting operation for two matrices. The first is a pheromone matrix (eta), defined as the inverse of the input data matrix:

eta[i,j] = 1.0 / input[i,j] (1)

Where "eta" is the pheromone matrix and "input" is a matrix of the input data which are to be defined before modeling ACO for the optimization problem solution. The second matrix, which is also to be defined, is:

tau[i,j] = 1.0 (2)

This is a compound matrix which defines the operation of each simulated agent, like artificial ants. During each iteration, we define the next candidate to be chosen according to the following optimization function:

v = max { tau[s,i] * eta[s,i] } (3)

When the candidate v is chosen, the matrix is updated according to the following rule:

tau[s,t] = tau[s,t] * rho + Q / len (4)

Where "rho" (between 0 and 1) and "Q" (between 0 and 100) are arbitrary parameters and "len" is the optimization function along the computed path of the selected candidates. The equations (1)–(4) are important to learn as they represent the inner output of the ACO algorithm in the main iteration, which can be fixed or infinite and usually converges after a pre-defined number of steps equal to the size of the output.

NP-Hard and NP-Complete Problems

The universal approach of solving hard problems without quantum computing or other means is the general acceptance method of the AI and solution of the so-called NP-hard or NP-complete problems, which are also closed under EXPSPACE and EXPTIME. Marco Dorigo's swarm optimization is an artificial intelligence methodology in its unified variant, giving the opportunity to solve NP-complete problems in a reasonable amount of time and quadratic space.

Explainable ML

§III Explainable Machine Learning

After the overview of the research on the latest publications for Machine Learning and its application benefits and shortcomings, we present an algorithmically practical solution for ML, known as Explainable Machine Learning, or simply EML. EML relies on input data and can be represented as the matrix of sorted elements by the category and priority at each row and column. The steps of building this matrix and evaluation of the query are defined as follows:

Sort the input data matrix with given priority of each factor;
Query the result for each data in the sorted matrix for the short-coming range of lower and upper row;
For the number values do the floor or ceil of the value and compare it same way as the word entities in the input matrix.

Neural networks, which are based upon sigmoid functions, can be extended for EML as the algorithm for arbitrary function with respect to the term various probability type, which is well-known in modeling theory of mass-servicing queues. The algorithm for an arbitrary function f(x) is as follows:

Define the function f(x) and its range L and R;
Compute the minimum and maximum values of function f(x) on this range using differential calculus;
Train the neural network by applying function f(x) divided by the minimum and maximum, with subtraction of the minimum.

The steps above give the novel example of arbitrary learning without usage of sigmoid-function for neural network training.

EML Advantages

Explainable Machine Learning does not require the neural network to be trained, reducing the time of development significantly. The EML decision tree can be expanded by online query, and it operates in linear time, with a query performance time of O(n), where n represents the number of factors in the data matrix of the input. This is a meaningful practical advantage over traditional neural network approaches, which require significant upfront training time.

Computer Vision

§IV Computer Vision

At present, the most well-known algorithms are for pattern string matching, i.e., the problem of finding the occurrences of a pattern in a matching string. Examples would include Knuth-Morris-Pratt and Boyer-Moore, both of which were adapted for digital patterns when patterns can be taken from a scanned image. We present a linear error-prune algorithm for OCR text matching.

The matching cost can be computed using a variety of techniques, such as Hamming distance. For experimental purposes, the Tesseract OCR engine was used. However, the Hamming distance is good for low-cost computing, while the approach described in this paper can be used for large data. To handle large data threads, we use the windowing technique. Under this technique, the necessary data for a pre-defined time interval can be obtained as:

Window [1….t] = Data × Time[1..t] (5)

From the obtained data, the penalty can be computed. This penalty is the measure of equivalence between searching text and scrambled data, possibly by an OCR engine. Thus, the following holds true:

Penalty [Data [1…t]] == A ~ B (6)

Where 'A' is the searching text and 'B' is scrambled data.

Naive Approach to OCR Text Matching

The naive approach is simple, as it only requires calculating the distance between two words. This method has practical applications when the number of symbols (or the weight of word) in text to be searched is small. By this method we get the magnitude equal to the weight of the text. To compute this distance, a bi-linear search is executed at every position in the scrambled text. The cost is thus a cumulative sum of penalties at every step when symbols coincide:

Cost (A, B) = SUM { 1.0 / Distance[X[1], X[2]] } (7)

Where A, B are words and X[1], X[2] are positions of two consecutive matchings. It should be noted that the algorithm experimentally produces good results for low-cost pattern matching. Basic Java code can be utilized for this purpose.

Scalable Window Approach

The scalable window approach opposite to the naive method uses a flowing window event when mismatching occurs, in other words, it slowly checks for occurrences of a match in the appearing window. Thus, the possible result of excess symbols in scrambled text is reduced. The sub-algorithm decides what symbol will be the gap's left border, while the right border lies on the mismatched position [Y[1], Y[2]] (in pattern and scrambled text). The algorithm uses input parameters for the windowing algorithm, and this set can be extended. The final reduction, which requires minimal steps or data manipulation, leads to the exact expected text extraction.

§V Conclusion

We have presented the modern results on Machine Learning, which is named in this article as Explainable Machine Learning. We presented it as it is more relevant and does not require the neural network to be trained, reducing the time of development significantly. This is because the EML decision tree can be expanded by online query, and it operates in linear time, with a query performance time of O(n) where n represents the number of factors in the data matrix of the input.

We have also concluded that Marco Dorigo's swarm optimization is an artificial intelligence methodology in its unified variant, giving the opportunity to solve NP-complete problems in a reasonable amount of time and quadratic space. The question of artificial intelligence and its consciousness, as well as the global impact of it on economics and the global labor market, are questions which we pose, but acknowledge that further research and data are needed on.

References

Coller, Kevin. "What is consciousness? ChatGPT and advanced AI might redefine our answer" // https://www.nbcnews.com/tech/tech-news/chatgpt-ai-consciousness-rcna71777 (accessed 3/30/2023).
Hildt, Elisabeth "Artificial Intelligence: Does Consciousness Matter?" // https://www.frontiersin.org/articles/10.3389/fpsyg.2019.01535/full (accessed 3/30/2023).
Jordan, Michael I., and Tom M. Mitchell. "Machine learning: Trends, perspectives, and prospects." Science 349.6245 (2015): 255-260.
Mahesh, Batta. "Machine learning algorithms-a review." International Journal of Science and Research (IJSR). 9 (2020): 381-386.
Athey, Susan. "The impact of machine learning on economics." The economics of artificial intelligence: An agenda. University of Chicago Press, 2018. 507-547.
Chen, Mingzhe, et al. "Artificial neural networks-based machine learning for wireless networks: A tutorial." IEEE Communications Surveys & Tutorials 21.4 (2019): 3039-3071.

Discuss this research.

Interested in optimization algorithms, Explainable Machine Learning, or AI applications in computer vision?

Get in Touch