HPC and big data convergence in fraud detection

HPC Insights

The convergence of two existing disciplines can be an explosively creative force. A great example within the world of tech is the convergence of HPC merging with big data and machine learning. Though in many ways this convergence is still in its early stages, the merging of these technologies is already starting to deliver concrete, real world benefits in the fraud detection field, helping save financial firms hundreds of millions of dollars.

Unsurprisingly, PayPal is one of the companies at the forefront of this convergence. As an online transaction processor that was conceived on the Internet, Paypal has grown up exposed to virtually every cyber security threat and fraud imaginable. Because of this, the company has been aggressively pursuing a security strategy that used both HPC and big data technologies as early as 2001. Although PayPal keeps details of its fraud protection systems a secret, it has been very open about leveraging the flexibility of the open-source H20 machine learning framework in conjunction with its big data infrastructure, which gathers more than 20 terabytes of log data every day.

To gain insight from this massive Hadoop dataset, Paypal, which handles over 13 million online monetary transactions per day, combines the three types of machine learning — linear, nonlinear, and deep learning — to help identify and stop fraudsters. The company estimates that just in the first few years of deploying their fraud detection systems, they’ve saved over $700 million dollars in fraudulent transactions that they otherwise wouldn’t have noticed.

Though PayPal may be have been one of the first to recognise the value of converging HPC and big data technologies, today virtually all the major financial services firms are seeking novel ways to combine these technologies in order to protect themselves. Another high-profile example is MasterCard, which has a staggering 2.2 billion cards in use in 330 countries, and handles roughly 160 million transactions per hour, or 52 billion transactions a year. MasterCard, much like PayPal, employs a hybrid machine learning approach that uses supervised and unsupervised learning, in addition to traditional big data technologies like Hadoop and Spark, to examine the location, spending habits, and travel patterns of the customer before each purchase is made. According to Vice President of Global Big Data Consulting at MasterCard, Nick Curcuru, the company’s infrastructure applies 1.9 million distinct rules to examine each transaction, and processes each of these transactions in just milliseconds.

Other major institutions have developed their own mix of technologies to protect themselves against fraud. CitiBank, for example, has recently made investments in machine learning companies like Feedzai, Cylance and Ayasdi in order to bolster its fraud detection capability, and the company just recently announced that it would open a branch of its Global Innovation Lab at a London WeWork, specialising in the development of big data and high performance computing technologies.

The quest for better fraud protection is a never-ending one, however, as credit card fraud continues to increase in severity. Credit card fraud in Europe caused 1.8 billion Euro in damages in 2016 (with the UK and France accounting for 73% of that), while in the US credit card fraud has been a persistent problem, and even debit cards, which until now have been relatively safe, are now starting to see a rise in fraud.

The problems of controlling fraud are compounded by demands from consumers, who want faster, easier payments, and greater flexibility. This pressure was almost certainly a factor when the Payment Card Industry (PCI) Security Standards Council - the regulatory body that oversees transactions from credit and debit cards - decided to start allowing PIN numbers to be entered on mobile phones. The decision, which may benefit the user experience, also introduces a new level of vulnerability into the transaction process, and therefore has elicited some skepticism.

Delivering an improved user experience while also maintaining maximum security will necessitate improved fraud detection systems. This points to a need for stronger, more deeply converged systems that can gather deeper insight from customer and transaction data. Realising the better union of HPC, big data, and machine learning will require a deeper harmony between once heterogeneous computational infrastructures, and perhaps also a shift in perception. Challenges aside, major players are already making serious commitments to this convergence, and envision a not-to-distant future where the three workloads can be seamlessly processed on one system.

Verne Global, which provides its clients in these fields with low-cost, fully-sustainable power resources is in an ideal position to help facilitate this transformation — a role we’re eager to fulfill.

Written by Spencer Lamb

See Spencer Lamb's blog

Spencer is Verne Global's Director of Research and head's up our high performance computing work with European research and scientific organisations. He is also a member of the European Technology Platform for High Performance Computing (ETP4HPC).

Related blogs

NeurIPS – Rumours from the trade show floor

Generally, trade shows follow the sun and tourists to popular vacation destinations. Everyone loves a conference in San Diego or Orlando! The recently rebranded NeurIPS (formally NIPS) took a different road this year and visited Montreal in early December. Montreal is one of my favourite cities but in early December it’s the season for cold, cloudy weather and infrequent freezing rain. Here's a quick rundown on my experiences at the conference.

Read more

G-Cloud 10 makes accessing high performance computing easier then ever...

As the Director of Research at Verne Global I spend a lot of my time working with our colleagues and partners within the UK’s publicly funded universities and research and science community. I’m privileged to get to see some of the truly innovative and inspiring research that is taking place, using high performance computing (HPC) and further encouraged with how Verne Global is helping them do this. This is why I was delighted to see Verne Global’s participation in the G-Cloud 10 (G10) framework confirmed last week and indeed strengthened for 2018/19 – enabling more public sector bodies to enjoy the benefits of our on-demand true hpcDIRECT platform.S

Read more

AI Events - Rumours from the trade show floor

After of a couple of weeks sunning and sailplane racing in Florida it was straight back into the saddle for a London-based Quarterly Business Review and the excellent “Rise of AI” event in Berlin. Following a year of attending various international AI events I’m starting to develop a feel for their differing textures and here is my current evaluation.

Read more

We use cookies to ensure we give you the best experience on our website, to analyse our website traffic, and to understand where our visitors are coming from. By browsing our website, you consent to our use of cookies and other tracking technologies. Read our Privacy Policy for more information.