Archive for the ‘Machine Learning’ Category

DIY Camera Uses Machine Learning to Audibly Tell You What it Sees – PetaPixel

Adafruit Industries has created a machine learning camera built with the Raspberry Pi that can identify objects extremely quickly and audibly tell you what it sees. The group has listed all the necessary parts you need to build the device at home.

The camera is based on Adafruits BrainCraft HAT add-on for the Raspberry Pi 4, and uses TensorFlow Lite object recognition software to be able to recognize what it is seeing. According to Adafruits website, its compatible with both the 8-megapixel Pi camera and the 12.3-megapixel interchangeable lens version of module.

While interesting on its own, DIY Photography makes a solid point by explaining a more practical use case for photographers:

You could connect a DSLR or mirrorless camera from its trigger port into the Pis GPIO pins, or even use a USB connection with something like gPhoto, to have it shoot a photo or start recording video when it detects a specific thing enter the frame.

A camera that is capable of recognizing what it is looking at could be used to only take a photo when a specific object, animal, or even a person comes into the frame. That would mean it could have security system or wildlife monitoring applications. Whenever you might wish your camera knew what it was looking at, this kind of technology would make that a reality.

You can find all the parts you will need to build your own version of this device on Adafruits website here. They also have published an easy machine learning guide for the Raspberry Pi as well as a guide on running TensorFlow Lite.

(via DPReview and DIY Photography)

Read more here:
DIY Camera Uses Machine Learning to Audibly Tell You What it Sees - PetaPixel

How machine learning was used to decode an ancient Chinese cave – Times of India

The name of the cell, a prosaic Cave 465, does not quite convey the cornucopia of imagery it contains angry Tantric deities in frenzied sexual union with their consorts. For decades, researchers have tried figuring out how old the Buddhist cave temple at the Mogao site along the ancient Silk Road in China is. Estimates range from the 9th century to the 14th. But now, the discovery of hidden Sanskrit inscriptions on pieces of paper stuck to its ceiling have helped narrow down its origins.On the edge of the Gobi desert, by the Dachuan river, the Mogao Caves have baffled researchers, who have settled on a thousand-year window for when all the 492 caves were carved out of cliffs, one at a long time, starting in the 4th century CE. Each cell, at first, appears isolated in its own history, linked to others through a grid of associations established by identifying pigments, painting styles or plain old radiocarbon dating. But Cave 465, to the north of the site, is unique.

Link:
How machine learning was used to decode an ancient Chinese cave - Times of India

SVG Tech Insight: Increasing Value of Sports Content Machine Learning for Up-Conversion HD to UHD – Sports Video Group

This fall SVG will be presenting a series of White Papers covering the latest advancements and trends in sports-production technology. The full series of SVGs Tech Insight White Papers can be found in the SVG Fall SportsTech Journal HERE.

Following the height of the 2020 global pandemic, live sports are starting to re-emerge worldwide albeit predominantly behind closed doors. For the majority of sports fans, video is the only way they can watch and engage with their favorite teams or players. This means the quality of the viewing experience itself has become even more critical.

With UHD being adopted by both households and broadcasters around the world, there is a marked expectation around visual quality. To realize these expectations in the immediate term, it will be necessary for some years to up-convert from HD to UHD when creating 4K UHD sports channels and content.

This is not so different from the early days of HD, where SD sporting related content had to be up-converted to HD. In the intervening years, however, machine learning as a technology has progressed sufficiently to be a serious contender for performing better up-conversions than with more conventional techniques, specifically designed to work for TV content.

Ideally, we want to process HD content into UHD with a simple black box arrangement.

The problem with conventional up-conversion, though, is that it does not offer an improved resolution, so does not fully meet the expectations of the viewer at home watching on a UHD TV. The question, therefore, becomes: can we do better for the sports fan? If so, how?

UHD is a progressive scan format, with the native TV formats being 38402160, known as 2160p59.64 (usually abbreviated to 2160p60) or 2160p50. The corresponding HD formats, with the frame/field rates set by region, are either progressive 1280720 (720p60 or 720p50) or interlaced 19201080 (1080i30 or 1080i25).

Conversion from HD to UHD for progressive images at the same rate is fairly simple. It can be achieved using spatial processing only. Traditionally, this might typically use a bi-cubic interpolation filter, (a 2-dimensional interpolation commonly used for photographic image scaling.) This uses a grid of 44 source pixels and interpolates intermediate locations in the center of the grid. The conversion from 1280720 to 38402160 requires a 3x scaling factor in each dimension and is almost the ideal case for an upsampling filter.

These types of filters can only interpolate, resulting in an image that is a better result than nearest-neighbor or bi-linear interpolation, but does not have the appearance of being a higher resolution.

Machine Learning (ML) is a technique whereby a neural network learns patterns from a set of training data. Images are large, and it becomes unfeasible to create neural networks that process this data as a complete set. So, a different structure is used for image processing, known as Convolutional Neural Networks (CNNs). CNNs are structured to extract features from the images by successively processing subsets from the source image and then processes the features rather than the raw pixels.

Up-conversion process with neural network processing

The inbuilt non-linearity, in combination with feature-based processing, mean CNNs can invent data not in the original image. In the case of up-conversion, we are interested in the ability to create plausible new content that was not present in the original image, but that doesnt modify the nature of the image too much. The CNN used to create the UHD data from the HD source is known as the Generator CNN.

When input source data needs to be propagated through the whole chain, possibly with scaling involved, then a specific variant of a CNN known as a Residual Network (ResNet) is used. A ResNet has a number of stages, each of which includes a contribution from a bypass path that carries the input data. For this study, a ResNet with scaling stages towards the end of the chain was used as the Generator CNN.

For the Generator CNN to do its job, it must be trained with a set of known data patches of reference images and a comparison is made between the output and the original. For training, the originals are a set of high-resolution UHD images, down-sampled to produce HD source images, then up-converted and finally compared to the originals.

The difference between the original and synthesized UHD images is calculated by the compare function with the error signal fed back to the Generator CNN. Progressively, the Generator CNN learns to create an image with features more similar to original UHD images.

The training process is dependent on the data set used for training, and the neural network tries to fit the characteristics seen during training onto the current image. This is intriguingly illustrated in Googles AI Blog [1], where a neural network presented with a random noise pattern introduces shapes like the ones used during training. It is important that a diverse, representative content set is used for training. Patches from about 800 different images were used for training during the process of MediaKinds research.

The compare function affects the way the Generator CNN learns to process the HD source data. It is easy to calculate a sum of absolute differences between original and synthesized. This causes an issue due to training set imbalance; in this case, the imbalance is that real pictures have large proportions with relatively little fine detail, so the data set is biased towards regenerating a result like that which is very similar to the use of a bicubic interpolation filter.

This doesnt really achieve the objective of creating plausible fine detail.

Generative Adversarial Neural Networks (GANs) are a relatively new concept [2], where a second neural network, known as the Discriminator CNN, is used and is itself trained during the training process of the Generator CNN. The Discriminator CNN learns to detect the difference between features that are characteristic of original UHD images and synthesized UHD images. During training, the Discriminator CNN sees either an original UHD image or a synthesized UHD image, with the detection correctness fed back to the discriminator and, if the image was a synthesized one, also fed back to the Generator CNN.

Each CNN is attempting to beat the other: the Generator by creating images that have characteristics more like originals, while the Discriminator becomes better at detecting synthesized images.

The result is the synthesis of feature details that are characteristic of original UHD images.

With a GAN approach, there is no real constraint to the ability of the Generator CNN to create new detail everywhere. This means the Generator CNN can create images that diverge from the original image in more general ways. A combination of both compare functions can offer a better balance, retaining the detail regeneration, but also limiting divergence. This produces results that are subjectively better than conventional up-conversion.

Conversion from 1080i60 to 2160p60 is necessarily more complex than from 720p60. Starting from 1080i, there are three basic approaches to up-conversion:

Training data is required here, which must come from 2160p video sequences. This enables a set of fields to be created, which are then downsampled, with each field coming from one frame in the original 2160p sequence, so the fields are not temporally co-located.

Surprisingly, results from field-based up-conversion tended to be better than using de-interlaced frame conversion, despite using sophisticated motion-compensated de-interlacing: the frame-based conversion being dominated by the artifacts from the de-interlacing process. However, it is clear that potentially useful data from the opposite fields did not contribute to the result, and the field-based approach missed data that could produce a better result.

A solution to this is to use multiple fields data as the source data directly into a modified Generator CNN, letting the GAN learn how best to perform the deinterlacing function. This approach was adopted and re-trained with a new set of video-based data, where adjacent fields were also provided.

This led to both high visual spatial resolution and good temporal stability. These are, of course, best viewed as a video sequence, however an example of one frame from a test sequence shows the comparison:

Comparison of a sample frame from different up-conversion techniques against original UHD

Up-conversion using a hybrid GAN with multiple fields was effective across a range of content, but is especially relevant for the visual sports experience to the consumer. This offers a realistic means by which content that has more of the appearance of UHD can be created from both progressive and interlaced HD source, which in turn can enable an improved experience for the fan at home when watching a sports UHD channel.

1 A. Mordvintsev, C. Olah and M. Tyka, Inceptionism: Going Deeper into Neural Networks, 2015. [Online]. Available: https://ai.googleblog.com/2015/06/inceptionism-going-deeper-into-neural.html

2 I. e. a. Goodfellow, Generative Adversarial Nets, Neural Information Processing Systems Proceedings, vol. 27, 2014.

More:
SVG Tech Insight: Increasing Value of Sports Content Machine Learning for Up-Conversion HD to UHD - Sports Video Group

This New Machine Learning Tool Might Stop Misinformation – Digital Information World

Misinformation has always been a problem, but the combination of widespread social media as well as a loose definition of what can be seen as factual truth in recent times has lead to a veritable explosion in misinformation over the course of the past few years. The problem is so dire that in a lot of cases websites are made specifically because of the fact that this is the sort of thing that could potentially end up allowing misinformation to spread more easily, and this is a problem that might just have been addressed by a new machine learning tool.

This machine learning tool was developed by researchers at UCL, Berkeley and Cornell will be able to detect domain registration data and use this to ascertain whether the URL is legitimate or if it has been made specifically to legitimize a certain piece of information that people might be trying to spread around. A couple of other factors also come into play here. For example, if the identity of the person that registered the domain is private, this might be a sign that the site is not legitimate. The timing of the domain registration matters to. If it was done around the time a major news event broke out, such as the recent US presidential election, this is also a negative sign.

With all of that having been said and out of the way, it is important to note that this new machine learning tool has a pretty impressive success rate of about 92%, which is the proportion of fake domains it was able to discover. Being able to tell whether or not a news source is legitimate or whether it is direct propaganda is useful because of the fact that it can help reduce the likelihood that people might just end up taking the misinformation seriously.

Go here to see the original:
This New Machine Learning Tool Might Stop Misinformation - Digital Information World

Machine learning removes bias from algorithms and the hiring process – PRNewswire

Arena Analytics' Chief Data Scientist unveils a cutting edge technique that removes latent bias from algorithmic models.

Currently, the primary methods of reducing the impact of bias on models has been limited to adjusting input data or adjust models after-the-fact to ensure there is no disparate impact.

Recent reporting from the Wall Street Journal confirmed these as the most recent advances, concluding, "It's really up to the software engineers and leaders of the company to figure out how to fix it [or] go into the algorithm and tweak some of the main factors it considers in making its decisions."

For several years, Arena Analytics was also limited to these approaches, but that all changed 9 months ago. Up until then, Arena removed all data from the models that could correlate to protected classifications and then measured demographic parity.

"These efforts brought us in line with EEOC compliance thresholds - also known as the or 80% rule," explains Myra Norton, President/COO of Arena. "But we've always wanted to go further than a compliance threshold.We've wanted to surface a MORE diverse slate of candidates for every role in a client organization.And that's exactly what we've accomplished, now surpassing 95% in our representation of different classifications."

Chief Data Scientist Patrick Hagerty will explain at MLConf the way he and his team have leveraged techniques known asadversarial networks,an aspect of Generative Adversarial Networks (GAN's), tools that pit one algorithm against another.

"Arena's primary model predicts the outcomes our clients want, and Model Two is a Discriminator designed to predict a classification," says Hagerty. "The Discriminator attempts to detect the race, gender, background, and any other protected class data of a person. This causes the Predictor to adjust and optimize while eliminating correlations with the classifications the Discriminator is detecting."

Arena trained models to do this until achieving what's known as the Nash Equilibrium. This is the point at which the predictor and discriminator have reached peak optimization.

Arena's technology has helped industrious individuals find a variety of jobs - from RNs to medtechs, caregivers to cooks, concierge to security. Job candidates who Arena predicted for success include veterans with no prior experience in healthcare or senior/assisted living, recent high school graduates whose plans to work while attending college were up-ended, and former hospitality sector employees who decided to apply their dining service expertise to a new setting.

"We succeeded in our intent to reduce bias and diversify the workforce, but what surprised us was the impact this approach had on our core predictions. Data once considered unusable, such as commuting distance, we can now analyze because we've removed the potentially-associated protected-class-signal," says Michael Rosenbaum, Arena's founder and CEO. "As a result, our predictions are stronger AND we surface a more diverse slate of candidates across multiple spectrums. Our clients can now use their talent acquisition function to really support and lead out front on Diversity and Inclusion."

About Arena (https://www.arena.io/) applies predictive analytics and machine learning to solve talent acquisition challenges. Learning algorithms analyze a large amount of data topredict with high levels of accuracy the likelihood of different outcomes occurring, such as someone leaving, being engaged, having excellent attendance, and more. By revealing each individual's likely outcomes in specific positions, departments, and locations, Arena is transforming the labor market from one based on perception and unconscious bias, to one based on outcomes. Arena is currently growing dramatically within the healthcare and hospitality industry and expanding its offerings to other people intensive industries. For more information contact [emailprotected]arena.io

SOURCE Arena

https://www.arena.io/

Read more here:
Machine learning removes bias from algorithms and the hiring process - PRNewswire