Archive for the ‘Machine Learning’ Category

Want to Land Your Dream ML Job at Tesla? Here’s How? – Analytics Insight

Now landing your favorite ML job at Tesla might be closer than you think! Read to learn more.

If working at Tesla is one of your dream jobs then heres your chance. Tesla is hiring artificial intelligence and machine learning engineers and if you think you have done exceptional work in software, hardware, or AI then you may directly apply with your CV by visiting the official Tesla website. While you may be wondering what qualifies to be called exceptional work then the CEO, Elon Musk himself has provided a tip. In one of his tweets, Musk said, As always, Tesla is looking for hardcore AI engineers who care about solving problems that directly affect peoples lives in a major way.

Tesla has scope for machine learning engineers who are interested in working in developing Full Self-Driving (FSD) chips, Dojo systems, Neural Networks, Autonomy Algorithms, and coding. If given a chance you may be part of the Tesla Bot project as well. In another tweet, Elon Musk clarified that he is fine with hiring people without AI background too. A background in AI is not needed, just exceptional skill in software or computer design, he said. On the other hand, the Tesla website job posting has described that the company is looking for mechanical, electrical, controls, and software engineers to help leverage its AI expertise beyond the vehicle fleet.

For hardcore coders, Tesla wants engineers who can build Autopilot software foundations up from the lowest levels of the stack, tightly integrating with Teslas custom hardware. The company aims to Implement super-reliable bootloaders with support for over-the-air updates and bring up customized Linux kernels. It wants the ML engineers to write fast, memory-efficient low-level code to capture high-frequency, high-volume data from the sensors, and to share it with multiple consumer processes without impacting central memory access latency or starving critical functional code from CPU cycles.

Another special job related to machine learning in Tesla is Autopilot Simulation Deep Learning Engineers. Autopilot is of critical importance to Teslas mission. It is safer, makes driving more enjoyable, and will ultimately deliver on the promise of self-driving cars. As a member of Teslas Autopilot Simulation team, you will be in a unique position to accelerate the pace at which Autopilot improves over time. The Simulation team realizes these goals through generating synthetic datasets for neural network training, building tools that enable Autopilot software developers to perform virtual test drives instead of real ones, and through testing all Autopilot code changes and software releases for regressive behavior.

As an Autopilot Simulation Deep Learning Engineer, you will contribute to the development of Autopilot simulation by enabling and accelerating the creation of photorealistic 3D scenes through neural rendering, neural animation, and scene/object reconstruction. You will be at the cutting edge of deep learning applications and will train neural networks on a cluster in large-scale distributed settings as Tesla has one of the biggest training clusters in the world. You will have tremendous reach and autonomy, with an exciting opportunity to impact Teslas goal of full self-driving and its mission to accelerate the worlds transition to sustainable energy. As the machine learning engineer, you will be a creative self-starter with an ability to self-assign projects across different areas of the Autopilot team. You will also have the opportunity to identify gaps between real and simulated environments, and then you will get to close those gaps through state-of-the-art machine learning/deep learning methods.

Share This ArticleDo the sharing thingy

See more here:
Want to Land Your Dream ML Job at Tesla? Here's How? - Analytics Insight

Meet the winners of the Machine Learning Hackathon by Swiss Re & MachineHack – Analytics India Magazine

Swiss Re, in collaboration with MachineHack, successfully completed the Machine Learning Hackathon held from March 11th to 28th for data scientists and ML professionals to predict accident risk scores for unique postcodes. The end goal? To build a machine learning model to improve auto insurance pricing.

The hackathon saw over 1100+ registrations and 300+ participants from interested candidates. Out of those, the top five were asked to participate in a solution showcase held on the 6th of April. The top five entries were judged by Amit Kalra, Managing Director, Swiss Re and Jerry Gupta, Senior Vice President, Swiss Re who engaged with the top participants, understood their solutions and presentations and provided their comments and scores. From that emerged the top three winners!

Lets take a look at the winners who impressed the judges with their analytics skills and took home highly coveted cash prizes and goodies.

Pednekar comes with over 19 years of work experience in IT, project management, software development, application support, software system design, and requirement study. He is passionate about new technologies, especially data science, AI and machine learning.

My expertise lies in creating data visualisations to tell my datas story & using feature engineering to add new features to give a human touch in the world of machine learning algorithms, said Pednekar.

Pednekars approach consisted of seven steps:

For EDA, Pednekar has analysed the dataset to find out the relationship between:

Image: Rahul Pednekar

Image: Rahul Pednekar

Here, Pednekar merged Population & Road Network datasets with train using left join. He created Latitude and Longitude columns by extracting data from the WKT columns in Roads_network.

He proceeded to

And added new features:

Pednekar completed the following steps:

Image: Rahul Pednekar

Image: Rahul Pednekar

Pednekar has thoroughly enjoyed participating in this hackathon. He said, MachineHack team and the platform is amazing, and I would like to highly recommend the same to all data science practitioners. I would like to thank Machinehack for providing me with the opportunity to participate in various data science problem-solving challenges.

Check the code here.

Yadavs data science journey started a couple of years back, and since then, he has been an active participant in hackathons conducted on different platforms. Learning from fellow competitors and absorbing their ideas is the best part of any data science competition as it just widens the thinking scope for yourself and makes you better after each and every competition, says Yadav.

MachineHack competitions are unique and have a different business case in each of their hackathons. It gives a field wherein we can practice and learn new skills by applying them to a particular domain case. It builds confidence as to what would work and what would not in certain cases. I appreciate the hard work the team is putting in to host such competitions, adds Yadav.

Check the code here.

Rank 03: Prudhvi Badri

Badri entered the data science field while pursuing a masters in computer science at Utah State University in 2014 and had taken classes related to statistics, Python programming and AI, and wrote a research paper to predict malicious users in online social networks.

After my education, I started to work as a data scientist for a fintech startup company and built models to predict loan default risk for customers. I am currently working as a senior data scientist for a website security company. In my role, I focus on building ML models to predict malicious internet traffic and block attacks on websites. I also mentor data scientists and help them build cool projects in this field, said Badri.

Badri mainly focused on feature engineering to solve this problem. He created aggregated features such as min, max, median, sum, etc., by grouping a few categorical columns such as Day_of_Week, Road_Type, etc. He built features from population data such as sex_ratio, male_ratio, female_ratio, etc.

He adds, I have not used the roads dataset that has been provided as supplemental data. I created a total of 241 features and used ten-fold cross-validation to validate the model. Finally, for modelling, I used a weighted ensemble model of LightGBM and XGBoost.

Badri has been a member of MachineHack since 2020. I am excited to participate in the competitions as they are unique and always help me learn about a new domain and let me try new approaches. I appreciate the transparency of the platform sharing the approaches of the top participants once the hackathon is finished. I learned a lot of new techniques and approaches from other members. I look forward to participating in more hackathons in the future on the MachineHack platform and encourage my friends and colleagues to participate too, concluded Badri.

Check the code here.

The Swiss Re Machine Learning Hackathon, in collaboration with MachineHack, ended with a bang, with participants presenting out-of-the-box solutions to solve the problem in front of them. Such a high display of skills made the hackathon intensely competitive and fun and surely made the challenge a huge success!

View post:
Meet the winners of the Machine Learning Hackathon by Swiss Re & MachineHack - Analytics India Magazine

IBM And MLCommons Show How Pervasive Machine Learning Has Become – Forbes

AI, Artificial Intelligence concept,3d rendering,conceptual image.

This week IBM announced its latest Z-series mainframe and MLCommons released its latest benchmark series. The two announcements had something in common Machine Learning (ML) acceleration which is becoming pervasive everywhere from financial fraud detection in mainframes to detecting wake words in home appliances.

While these two announcements were not directly related, but they are part of a trend, showing how pervasive ML has become.

MLCommons Brings Standards to ML Benchmarking

ML benchmarking is important because we often hear about ML performance in terms of TOPS trillions of operations per second. Like MIPS (Millions of Instructions per Second or Meaningless Indication of Processor Speed depending on your perspective), TOPS is a theoretical number calculated from the architecture, not a measured rating based on running workloads. As such, TOPS can be a deceiving number because it does not include the impact of the software stack., Software is the most critical aspect of implementing ML and the efficiency varies widely, which Nvidia clearly demonstrated by improving the performance of its A100 platform by 50% in MLCommons benchmarks over the years.

The industry organization MLCommons was created by a consortium of companies to build a standardized set of benchmarks along with a standardized test methodology that allows different machine learning systems to be compared. The MLPerf benchmark suites from MLCommons include different benchmarks that cover many popular ML workloads and scenarios. The MLPerf benchmarks addresses everything from the tiny microcontrollers used in consumer and IoT devices, to mobile devices like smartphones and PCs, to edge servers, to data center-class server configuration. Supporters of MLCommons include Amazon, Arm, Baidu, Dell Technologies, Facebook, Google, Harvard, Intel, Lenovo, Microsoft, Nvidia, Stanford and the University of Toronto.

MLCommons releases benchmark results in batches and has different publishing schedules for inference and for training. The latest announcement was for version 2.0 of the MLPerf Inference suite for data center and edge servers, version 2.0 for MLPerf Mobile, and version 0.7 for MLPerf Tiny for IoT devices.

To date, the company that has had the most consistent set of submissions, producing results every iteration, in every benchmark test, and by multiple partners, has been Nvidia. Nvidia and its partners appear to have invested enormous resources in running and publishing every relevant MLCommons benchmark. No other vendor can match that claim. The recent batch of inference benchmark submissions include Nvidia Jetson Orin SoCs for edge servers and the Ampere-based A100 GPUs for data centers. Nvidias Hopper H100 data center GPU, which was announced at Spring 2022 GTC, arrived too late to be included in the latest MLCommons announcement, but we fully expect to see Nvidia H100 results in the next round.

Recently, Qualcomm and its partners have been posting more data center MLPerf benchmarks for the companys Cloud AI 100 platform and more mobile MLPerf benchmarks for Snapdragon processors. Qualcomms latest silicon has proved to be very power efficient in data center ML tests, which may give it an edge on power-constrained edge server applications.

Many of the submitters are system vendors using processors and accelerators from silicon vendors like AMD, Andes, Ampere, Intel, Nvidia, Qualcomm, and Samsung. But many of the AI startups have been absent. As one consulting company, Krai, put it: Potential submitters, especially ML hardware startups, are understandably wary of committing precious engineering resources to optimizing industry benchmarks instead of actual customer workloads. But then Krai countered their own objection with MLPerf is the Olympics of ML optimization and benchmarking. Still, many startups have not invested in producing MLCommons results for various reasons and that is disappointing. Theres also not enough FPGA vendors participating in this round.

The MLPerf Tiny benchmark is designed for very low power applications such as keyword spotting, visual wake words, image classification, and anomaly detection. In this case we see results from a mix of small companies like Andes, Plumeria, and Syntiant, as well as established companies like Alibaba, Renesas, Silicon Labs, and STMicroeletronics.

IBM z16 Mainframe

IBM Adds AI Acceleration Into Every Transaction

While IBM didnt participate in MLCommons benchmarks, the company takes ML seriously. With its latest Z-Series mainframe computer, the z16, IBM has added accelerators for ML inference and quantum-safe secure boot and cryptography. But mainframe systems have different customer requirements. With roughly 70% of banking transactions (on a value basis) running on IBM mainframes, the company is anticipating the needs of financial institutes for extreme reliable and transaction processing protection. In addition, by adding ML acceleration into its CPU, IBM can offer per-transaction ML intelligence to help detect fraudulent transactions.

In an article I wrote in 2018, I said: In fact, the future hybrid cloud compute model will likely include classic computing, AI processing, and quantum computing. When it comes to understanding all three of those technologies, few companies can match IBMs level of commitment and expertise. And the latest developments in IBMs quantum computing roadmap and the ML acceleration in the z16, show IBM is a leader in both.

Summary

Machine Learning is important from tiny devices up to mainframe computers. Accelerating this workload can be done on CPUs, GPUs, FPGAs, ASICs, and even MCUs and is now a part of all computing going forward. These are two examples of how ML is changing and improving over time.

Tirias Research tracks and consults for companies throughout the electronics ecosystem from semiconductors to systems and sensors to the cloud. Members of the Tirias Research team have consulted for IBM, Nvidia, Qualcomm, and other companies throughout the AI ecosystems.

Read the original:
IBM And MLCommons Show How Pervasive Machine Learning Has Become - Forbes

Prognostics of unsupported railway sleepers and their severity diagnostics using machine learning | Scientific Reports – Nature.com

Unsupported sleeper detection

From the machine model development for detecting unsupported sleepers, the accuracy of each model is shown in Table 4.

From the table, it can be seen that each model performs well. The accuracy of each model is higher than 90% when the data processing is appropriate. CNN performs the best based on its accuracies. When CNN is applied with FFT and padding, the accuracies are the first and second highest compared to other models. For RNN and ResNet, the accuracies are higher than 90% when specific data processing is used. However, the accuracies become 80% approximately when another data processing technique is used. For FCN, data processing is not needed. The FCN model can achieve an accuracy of 95%. From the table, the models with the highest accuracy are CNN, RNN, FCN, and ResNet respectively. The complicated architecture of ResNet does not guarantee the highest accuracy. Moreover, the training time of ResNet (46s/epoch) is the longest followed by RNN (6s/epoch), FCN (2s/epoch), and CNN (1s/epoch) respectively. It can be concluded that the CNN model is the best model to detect supported sleepers in this study because it provides the highest accuracy or 100% while the training time is the lowest. At the same time, easy data processing likes padding is good enough to provide a good result. It is better than FFT in the CNN model which requires longer data processing. The accuracy of testing data of each model is shown in Fig.8.

Accuracies of testing data on unsupported sleeper detection.

The tuned hyperparameters of the CNN model with padding data are shown in Table 5.

Compared to the previous study, Sysyn et al.1 applied statistical methods and KNN which provided the best detection accuracy of 65%. The accuracy of the CNN model developed in this study is significantly higher. It can be assumed that the machine learning techniques used in this study are more powerful than the ones used in the previous study. Moreover, CNN is proven that it is suitable for pattern recognition.

For the unsupported sleeper severity classification, the performance of each model is shown in Table 6.

From the table, it can be seen that the CNN model still performs the best with an accuracy of 92.89% and provides good results with both data processing. However, the accuracies of RNN and ResNet significantly drop when unsuitable data processing is conducted. For example, the accuracy of the RNN model with padding drops to 33.89%. The best performance that RNN can achieve is 71.56% which is the lowest compared to other models. This is because of the limitation of RNN that vanishing gradient occurs when time-series data is too long. In this study, the number of data points for padding data is 1181 which can result in the issue. Therefore, RNN does not perform well. ResNet performs well with an accuracy of 92.42% close to CNN while the accuracy of FCN is fairly well. For the training time, CNN is the fastest model with the training time of 1s/epoch followed by FCN (2s/epoch), RNN (5s/epoch), and ResNet (32s/epoch) respectively. From these, it can be concluded that the CNN model is the best model for unsupported sleeper severity classification in this study. Moreover, it can be concluded that CNN and ResNet are suitable with padding data while RNN is suitable with FFT data. The accuracy of testing data of each model is shown in Fig.9.

Accuracies of testing data on unsupported sleeper severity classification.

The confusion matrix of the CNN model is shown in Table 7.

To clearly demonstrate the performance of each model, precision and recall are shown in Table 8.

From the table, the precisions and recalls of CNN and ResNet are fairly good with values higher than 80% while RNN is the worst. Some precisions of RNN are lower than 60% which cannot be used in realistic situations. CNN seems to be the better model than ResNet because all precisions are higher than 90%. Although some precisions of ResNet are higher than CNN, the precision of class 2 is about 80%. Therefore, the use of the CNN model is better.

For hyperparameter tuning, the tuned hyperparameters of CNN are shown in Table 9.

Originally posted here:
Prognostics of unsupported railway sleepers and their severity diagnostics using machine learning | Scientific Reports - Nature.com

Machine learning in higher education – McKinsey

Many higher-education institutions are now using data and analytics as an integral part of their processes. Whether the goal is to identify and better support pain points in the student journey, more efficiently allocate resources, or improve student and faculty experience, institutions are seeing the benefits of data-backed solutions.

Those at the forefront of this trend are focusing on harnessing analytics to increase program personalization and flexibility, as well as to improve retention by identifying students at risk of dropping out and reaching out proactively with tailored interventions. Indeed, data science and machine learning may unlock significant value for universities by ensuring resources are targeted toward the highest-impact opportunities to improve access for more students, as well as student engagement and satisfaction.

For example, Western Governors University in Utah is using predictive modeling to improve retention by identifying at-risk students and developing early-intervention programs. Initial efforts raised the graduation rate for the universitys four-year undergraduate program by five percentage points between 2018 and 2020.

Yet higher education is still in the early stages of data capability building. With universities facing many challenges (such as financial pressures, the demographic cliff, and an uptick in student mental-health issues) and a variety of opportunities (including reaching adult learners and scaling online learning), expanding use of advanced analytics and machine learning may prove beneficial.

Below, we share some of the most promising use cases for advanced analytics in higher education to show how universities are capitalizing on those opportunities to overcome current challenges, both enabling access for many more students and improving the student experience.

Data science and machine learning may unlock significant value for universities by ensuring resources are targeted toward the highest-impact opportunities to improve access for more students, as well as student engagement and satisfaction.

Advanced-analytics techniques may help institutions unlock significantly deeper insights into their student populations and identify more nuanced risks than they could achieve through descriptive and diagnostic analytics, which rely on linear, rule-based approaches (Exhibit 1).

Exhibit 1

Advanced analyticswhich uses the power of algorithms such as gradient boosting and random forestmay also help institutions address inadvertent biases in their existing methods of identifying at-risk students and proactively design tailored interventions to mitigate the majority of identified risks.

For instance, institutions using linear, rule-based approaches look at indicators such as low grades and poor attendance to identify students at risk of dropping out; institutions then reach out to these students and launch initiatives to better support them. While such initiatives may be of use, they often are implemented too late and only target a subset of the at-risk population. This approach could be a good makeshift solution for two problems facing student success leaders at universities. First, there are too many variables that could be analyzed to indicate risk of attrition (such as academic, financial, and mental health factors, and sense of belonging on campus). Second, while its easy to identify notable variance on any one or two variables, it is challenging to identify nominal variance on multiple variables. Linear, rule-based approaches therefore may fail to identify students who, for instance, may have decent grades and above-average attendance but who have been struggling to submit their assignments on time or have consistently had difficulty paying their bills (Exhibit 2).

Exhibit 2

A machine-learning model could address both of the challenges described above. Such a model looks at ten years of data to identify factors that could help a university make an early determination of a students risk of attrition. For example, did the student change payment methods on the university portal? How close to the due date does the student submit assignments? Once the institution has identified students at risk, it can proactively deploy interventions to retain them.

Though many institutions recognize the promise of analytics for personalizing communications with students, increasing retention rates, and improving student experience and engagement, institutions could be using these approaches for the full range of use cases across the student journeyfor prospective, current, and former students alike.

For instance, advanced analytics can help institutions identify which high schools, zip codes, and counties they should focus on to reach prospective students who are most likely to be great fits for the institution. Machine learning could also help identify interventions and support that should be made available to different archetypes of enrolled students to help measure and increase student satisfaction. These use cases could then be extended to providing students support with developing their skills beyond graduation, enabling institutions to provide continual learning opportunities and to better engage alumni. As an institution expands its application and coverage of advanced-analytics tools across the student life cycle, the model gets better at identifying patterns, and the institution can take increasingly granular interventions and actions.

Institutions will likely want to adopt a multistep model to harness machine learning to better serve students. For example, for efforts aimed at improving student completion and graduation rates, the following five-step technique could generate immense value:

Institutions could deploy this model at a regular cadence to identify students who would most benefit from additional support.

Institutions could also create similar models to address other strategic goals or challenges, including lead generation and enrollment. For example, institutions could, as a first step, analyze 100 or more attributes from years of historical data to understand the characteristics of applicants who are most likely to enroll.

Institutions will likely want to adopt a multistep model to harness machine learning to better serve students.

The experiences of two higher education institutions that leaned on advanced analytics to improve enrollment and retention reveal the impact such efforts can have.

One private nonprofit university had recently enrolled its largest freshman class in history and was looking to increase its enrollment again. The institution wanted to both reach more prospective first-year undergraduate students who would be a great fit for the institution and improve conversion in the enrollment journey in a way that was manageable for the enrollment team without significantly increasing investment and resources. The university took three important actions:

For this institution, advanced-analytics modeling had immediate implications and impact. The initiative also suggested future opportunities for the university to serve more freshmen with greater marketing efficiency. When initially tested against leads for the subsequent fall (prior to the application deadline), the model accurately predicted 85 percent of candidates who submitted an application, and it predicted the 35 percent of applicants at that point in the cycle who were most likely to enroll, assuming no changes to admissions criteria (Exhibit 3). The enrollment management team is now able to better prioritize its resources and time on high-potential leads and applicants to yield a sizable class. These new capabilities will give the institution the flexibility to make strategic choices; rather than focus primarily on the size of the incoming class, it may ensure the desired class size while prioritizing other objectives, such as class mix, financial-aid allocation, or budget savings.

Exhibit 3

Similar to many higher-education institutions during the pandemic, one online university was facing a significant downward trend in student retention. The university explored multiple options and deployed initiatives spearheaded by both academic and administrative departments, including focus groups and nudge campaigns, but the results fell short of expectations.

The institution wanted to set a high bar for student success and achieve marked and sustainable improvements to retention. It turned to an advanced-analytics approach to pursue its bold aspirations.

To build a machine-learning model that would allow the university to identify students at risk of attrition early, it first analyzed ten years of historical data to understand key characteristics that differentiate students who were most likely to continueand thus graduatecompared with those who unenrolled. After validating that the initial model was multiple times more effective at predicting retention than the baseline, the institution refined the model and applied it to the current student population. This attrition model yielded five at-risk student archetypes, three of which were counterintuitive to conventional wisdom about what typical at-risk student profiles look like (Exhibit 4).

Exhibit 4

Together, these three counterintuitive archetypes of at-risk studentswhich would have been omitted using a linear analytics approachaccount for about 70 percent of the students most likely to discontinue enrollment. The largest group of at-risk individuals (accounting for about 40 percent of the at-risk students identified) were distinctive academic achievers with an excellent overall track record. This means the model identified at least twice as many students at risk of attrition than models based on linear rules. The model outputs have allowed the university to identify students at risk of attrition more effectively and strategically invest in short- and medium-term initiatives most likely to drive retention improvement.

With the model and data on at-risk student profiles in hand, the online university launched a set of targeted interventions focused on providing tailored support to students in each archetype to increase retention. Actions included scheduling more touchpoints with academic and career advisers, expanding faculty mentorship, and creating alternative pathways for students to satisfy their knowledge gaps.

Advanced analytics is a powerful tool that may help higher-education institutions overcome the challenges facing them today, spur growth, and better support students. However, machine learning is complex, with considerable associated risks. While the risks vary based on the institution and the data included in the model, higher-education institutions may wish to take the following steps when using these tools:

While many higher-education institutions have started down the path to harnessing data and analytics, there is still a long way to go to realizing the full potential of these capabilities in terms of the student experience. The influx of students and institutions that have been engaged in online learning and using technology tools over the past two years means there is significantly more data to work with than ever before; higher-education institutions may want to start using it to serve students better in the years to come.

Here is the original post:
Machine learning in higher education - McKinsey