Fake Profile Identification using Machine learning

This project is about to create a framework, by this we can detect a fake profiles using ML algorithms, makes people social life more secure. The model presented in this project demonstrates that Support Vector Machine (SVM) is an elegant and robust method for binary classification in a large dataset. Regardless of the non-linearity of the decision boundary, SVM is able to classify between fake and genuine profiles with a reasonable degree of accuracy (>90%)

Platform? ? ? ? ?: Python
Delivery? ? ? ? ? :? One Day
Support? ? ? ? ? : Online Demo with Explanation
Deliverables? : Project Files, Report and Presentation

Description

Fake Profile Identification using Machine learning

Social network, a crucial part of our life is plagued by online impersonation and fake accounts. According to the ?Community Standards Enforcement Report? published by Facebook on March 2018, about 583 million fake accounts were taken down just in quarter 1 of 2018 and as many as 3-4% of its active accounts during this time were still fake. In this project, we propose a model that could be used to classify an account as fake or genuine. This model uses Support Vector Machine as a classification technique and can process a large dataset of accounts at once, eliminating the need to evaluate each account manually. The community of concern to us here is Fake Accounts and our problem can be said to be a classification or a clustering problem. Fake Profile Identification using Machine learning


Fake Profile Identification using Machine learning

Introduction:

In the present generation, the social life of everyone has become associated with online social networks. Adding new friends and keeping in contact with them and their updates has become easier. Online social networks have an impact on science, education, grassroots organizing, employment, business, etc. Researchers have been studying these online social networks to see the impact they make on people. Teachers can teach the students easily through this making a friendly environment for the students to study, teachers nowadays are getting themselves familiar with these sites by bringing online classroom pages, giving homework, making discussions, etc. which improves education a lot. Employers can use these social networking sites to employ people who are talented and interested in the work, their background checks can be done easily.

 


Existing System:

  • The naive Bayes algorithm has less accuracy.

Disadvantages:

  • Because of Privacy Issues, the Facebook dataset is very limited and a lot of details are not made public.

Proposed System:

The Application Domain of the following project was Community Detection. Community detection is key to understanding the structure of complex networks, and ultimately extracting useful information from them.

In this project, we came up with a framework through which we can detect a fake profile using machine learning algorithms so that the social life of people becomes secure.

  1. Classification starts from the selection of profile that needs to be classified.
  2. Once the profile is selected, the useful features are extracted for the purpose of classification.
  3. The extracted features are then fed to the trained classifier.
  4. The classifier is trained regularly as new data is fed into the classifier.
  5. The classifier then determines whether the profile is genuine or fake.
  6. The result of the classification algorithm is then verified and feedback is fed back into the classifier.
  7. As the number of training data increases the classifier becomes more and more accurate in predicting the fake profiles.

Advantages:

  • Social networking sites are making our social lives better but nevertheless, there are a lot of issues with using these social networking sites.
  • The issues are privacy, online bullying, the potential for misuse, trolling, etc. These are done mostly by using fake profiles. In
  • In this project, we came up with a framework through which we can detect a fake profile using machine learning algorithms so that the social life of people becomes secured.

Software and Hardware Requirements:

Hardware:

  • OS? Windows 7, 8 and 10 (32 and 64 bit)
  • RAM? 4GB

Software:

  • Python
  • Anaconda

 


CONCLUSIONS AND FUTURE WORK

The model presented in this project demonstrates that Support Vector Machine (SVM) is an elegant and robust method for binary classification in a large dataset. Regardless of the non-linearity of the decision boundary, SVM is able to classify between fake and genuine profiles with a reasonable degree of accuracy (>90%).? This method can be extended on any platform that needs binary classification to be deployed on public profiles for various purposes. This project uses only publicly available information which makes it convenient for organizations that want to avoid any breach of privacy, but organizations can also use private data available to them to further extend the capabilities of the proposed model.


Future Work

Since we have limited data to train the classifier, our approach is facing a high variance problem which can be observed in the learning curve as follows High variance problems can usually be mitigated by increasing the size of the dataset which should not be of much concern to Social Networks Organizations which already have

Customer Reviews

There are no reviews yet.

Be the first to review “Fake Profile Identification using Machine learning”

This site uses Akismet to reduce spam. Learn how your comment data is processed.