Similarity

Overview

The Text Similarity function plug-in allows users to compute a similarity score between pairs of texts. By computing a similarity score, one can find related items, group similar items, detect duplicates, and more.


Key Features & Functionality

Computes a similarity score between pairs of texts, returning a value between 0.0 and 1.0, where a higher score implies a higher similarity.

Anonymous
Parents
  • I had a look at the source code. It says

    // This plug-in is resource intensive. Through performance tests, we found out that sending
    // over 300 sentences, can potentially crash a dev site.

    // The model is large (~330MB) and thus consumes a decent amount of memory.

    It uses an Transformer AI model to calculate the cosine similarity between two vectors based on: stackoverflow.com/.../246508

Comment
  • I had a look at the source code. It says

    // This plug-in is resource intensive. Through performance tests, we found out that sending
    // over 300 sentences, can potentially crash a dev site.

    // The model is large (~330MB) and thus consumes a decent amount of memory.

    It uses an Transformer AI model to calculate the cosine similarity between two vectors based on: stackoverflow.com/.../246508

Children
  • Hi Stefan,

    Thank you for taking a look! As you point out, it's CPU-intensive, but the plug-in will take up to 300 strings to prevent any problems. I'll provide documentation shortly pointing out its usage and warnings.

    Please let me know if you have any comments or feedback,

    Thank you!