In the early 1960s, AI pioneer Herbert Simon observed that in a span of two decades, machines will match the cognitive abilities of humankind. Predictions like these motivated theorists, sceptics and thinkers from a cross-section of domains to find ways to use computers to perform routine tasks. From Heron’s Automatons in the first century to Google’s Deep Mind in the 21st century, mankind has yearned to make machines more ‘human’.
General Artificial Intelligence
The latest developments in AI, especially in the applications of Generative Adversarial Networks (GANs), can help researchers tackle the final frontier for replicating human intelligence. With a new paper being released every week, GANs are proving to be a front-runner for achieving the ultimate — General AI (AGI).
With the publication of this paper in 2014, applications of GANs have witnessed a tremendous growth. A noteworthy point is that there is a spike in the number of papers on GANs which is relatively higher than on contemporary deep learning architectures.
GANs can help machines plug in the last piece in the puzzle that is General AI. For example, GANs are now capable of creating realistic images out of the noise.
Do Whatever You GAN
GANs have advanced to a point where they can pick up trivial expressions denoting significant human emotions. For example, the notorious Deep Fakes deployed GAN architecture and reconstructed faces and torsos in three-dimensional space.
Here is an example where you can see how the network takes the input involving different sexes and gives a gender-specific output.
Celebrated computer scientist Yann Lecun observed, “GANs and the variations that are now being proposed is the most interesting idea in the last 10 years in ML, in my opinion.” Here are few applications which stand as a testimony to Lecun’s prognosis:
- WaveGANs can synthesise audio outputs which are previously thought to be invertible. It boasts of creating a coherent intelligent speech in an unsupervised setting. An analogy can be drawn from image-generating GANs to generate video. It could further be trained on an RNN to generate plausible sequences of latent vectors.
- InfoGAN manages to disintegrate representations and maximise the mutual information gained from latent variables. It marvels at differentiating between writing styles and pose from the lighting of 3D rendered images.
- While existing models can synthesise images based on global constraints such as a class label or caption, they do not provide control over pose or object location. Generative Adversarial What-Where Network (GAWWN), synthesises images describing what content to draw in which location.
- Projective generative adversarial networks(PrGANs) trains a deep generative model of 3D shapes whose projections match the distributions of the input 2D views. It allows us to predict 3D, viewpoint, and generate novel views from an input image in a completely unsupervised manner.
- Face Conditional Generative Adversarial Network(FCGAN) can generate high-resolution facial images from little to no prior information. Skip layers are used extensively in the sub-networks to improve the convergence speed. This can be used in curbing unwanted activities in urban areas and improving surveillance.
- A simple architecture, the Location-Aware Generative Adversarial Network (LAGAN), learns to produce realistic radiation patterns from simulated high energy particle collisions. This is a phenomenal achievement in linking generative machine learning and particle physics to produce 2D representations of energy depositions from particles interactions.
GANs can even learn to synthesise speech correlated to intricate human expressions, corresponding head pose or eye gaze. This further can be applied to diagnose depression or other mental disorders in its infancy. The meteoric rise in the popularity of GANs has been accompanied by its equally increasing capability to reach domains which have been immune to AI.
Variable auto-encoders can tweak in some of GANs’ powers (AAE) and help produce high-resolution images for research in molecular biology which further can be used for monitoring molecules of cancerous cells.
GANS — the real breakthrough in AI
GANs have made considerable noise in research circles. In the case of GANs, many parallels can be drawn from past networks which have been durable to sustain competition and flexible enough to integrate with new methodologies. These networks facilitate unforeseen hybridisation and the ease with which they merge pre-existing models makes them robust and undeniable.
From emoting super-realistic expressions to exploring deep space, from bridging the human-machine empathetic disconnect to introducing new art forms, GANs have it all covered. If AI research can be likened to probing a dark room then GANs might be the switch we have been looking for.
Try deep learning using MATLAB