JavaScript required
We’re sorry, but Coda doesn’t work properly without JavaScript enabled.
Skip to content
Gallery
Gandhi Surya Buana
Hello There, I'm Gandhi!
Personal Project
Achievement
More
Share
Explore
Personal Project
Data Integrations
Help you integrate and match various database and data source
Example!
Research Background
Search engines have made it easier to conduct literature reviews. However, for some niche topics, search results are often poor.
Snowballing and recommendations from paper repositories may help, but have limitations.
A general tool to search for similar articles without relying on a specific database would be very helpful.
Before that, optimal method to match similar articles needs to be determined.
Method to Evaluate
Four deep learning-based methods were evaluated:
Smooth Inverse Frequency (SIF)
Recurrent Neural Network (RNN)
Attention
Hybrid
They learn a representation to be fed to the actual classifier module, which is a two-layer HighwayNet.
Results
The attention method using only the article title as a feature was found to have the best result.
Execution time:
SIF is significantly faster than the other, while Hybrid is significantly slower.
The Attention method places third but only slightly slower than RNN.
Feature selection:
The title field was found to be the best feature for predicting similar pairs.
The authors’ feature was bad on its own but can improve results when combined with the title.
The keywords feature was found to be almost as good as the title, but combining them did not give significant improvement.
Want to print your doc?
This is not the way.
Try clicking the ⋯ next to your doc name or using a keyboard shortcut (
Ctrl
P
) instead.