Machine Learning networks outperform others....
Perhaps... but how relevant is it to your company and risks?
To those of you who assumed I am referring to a neural network or equivalent in deep learning. I am not.
You are right, however I have deliberately ignored this, here's why.
Neural networks are mostly opaque and not explainable. Their predictions cannot be interpreted, typically the features cannot be understood by the business, building and tuning is likely an architectural challenge that has low ROI for fraud prevention. To summarize it is overkill.
When I refer to ML networks, I mean entity link analysis. These have typically evolved into a graph database out of law enforcement / insurance from a classification based model.
For example how is this person, device, IP address, cookies, user agent, location address, social media footprint linked to another person’s, device, IP address...etc.
Vendors that utilize networks and apply ML against a network of data have a competitive advantage to others
The network built by the vendor may indicate fraud, or more realistically may be used in a Boolean way to identify interactions with historically proven fraud.
However, you need to ask yourself how relevant the data is to your product against your risks?
How relevant is:
Aside from considering the capability of the solution, fit to your product and risks you must also consider:
Ultimately, how relevant is the fraud network to your business?
An example of where a great fraud network may miss the mark in eBanking and authentication is Amazon
Amazon are diversifying from an ecommerce space into eBanking due to AWS and have brought across some useful tools such as Amazon Fraud Detector.
Their CNP network is arguably one of the best in the world given their capability ship 1.6m packages a day confidently. Their refund process is likely the envy of many banks as is their ease to enable a secure payment transfer.
This however doesn’t transfer well as part of an Digital Banking Authentication and Fraud Detection tool.
Why?
The AWS model is strong at solving a different problem in a Card Not Present (CNP) space specifically detecting checkout/chargeback fraud. Important data points to the model:
In an eBanking space you are more interested in the interaction of the user. The interaction of a user comprises of
Meaning the model is more interested in velocity-based features which can be attributed to user behaviour.
Typical data attributes are:
When you contrast this to CNP, then an IP address is of use, but we know IPv4 IP Addresses are recycled (I have some funny stories around that, but that's for another time).
The device can also be used but is it the same device used to bank as purchase on Amazon?
Email address is less relevant as are many of the features that make the CNP fraud prevention model performant. A CNP Model is typically evaluating checkout and chargeback risk.
Recommended by LinkedIn
Most fraud networks started in CNP and transferred to eBanking, threat metrix for example.
In eBanking authentication you are more concerned about frequencies and knowing your users. For example:
Anti Virus (AV) and other Cyber Threat Intelligence (CTI) platforms may also be irrelevant.
Typically an AV will defend the enterprise, staff and technology from different types of cyber attacks such as:
Which sounds great, however it is likely to miss the mark.
Whilst it is true that they have class leading research and data on different attacks based with forensic understanding. It is also true that the vast majority of the data will be of low value to defend a digital banking product when used as a threat intelligence network as they typically do not look at the user and their spend / behaviour.
Attributes such as an IP address and device ID may not be useful.
For example they likely relate to the server initiating a spray and pray phishing attack or a command and control for the botnet, not the botnet devices themselves used in the attack against an eBanking product.
This is a subtle but important difference.
Thus the data will not relate to the device used in the attack on an ebanking user, and even if it did it is simple to recycle both of these attributes.
Where an AV is more powerful is when it is deployed clientside i.e. to protect a mobile device or laptop.
However how many users do you know that download an AV recommended by the bank, privacy is a big concern here as trusteer found out.
So do Fraud Prevention ML networks outperform others?
It depends on how relevant they are to your product and risks.
5 steps to evaluate an ML Fraud Network:
I hope this seventh article was of use to you and this concludes the common misconceptions series.
I will recap these articles next week for those who want a closing fast overview.
Feedback is very welcome, please do let me know if there are any problems and I will look to update the post. Or if you think there is something I’ve missed and should cover, then reach out.
I love to learn and hearing from you all!
Thank you for your time.
Links referred to in the article:
Links
Fraud and Cyber Security Consultant specializes in Digital Banking, Fraud Prevention, Privacy, Identity, Authentication, and Transformation
2yPhilippe Saint-Amand Ben Balthazar Eduardo Hennemann de Oliveira Roman Bonbinkov Sharon L. Andreas Gutmann (PhD) Ismini Psychoula, PhD Andrew Abbotsford-Smith Daniel Mcloughlin Alex Kuznetsov Demetra Demetriou Hot off the press the next and final article in the common misconceptions series. Looking forward to your feedback :-)
Fraud and Cyber Security Consultant specializes in Digital Banking, Fraud Prevention, Privacy, Identity, Authentication, and Transformation
2yCarl Nacouzi I would love your thoughts on this article. Perhaps contentious, lets see :-)