Follow

Guide to: Running a Smart Text Collection Job

Overview

Smart Text includes a number of features to ensure high quality, customized and original data, including disabling copy/paste, minimum and maximum word counts, robust spelling and grammar checks and now Rich Text. Smart Text will smooth the contributors' writing experience and ensure high-quality output, especially for jobs related to LLMs, such as creative writing prompt/response pairs and response improvement.

In addition to the validators outlined below, Smart Text is also compatible with the Smart Validators described in this article.

Job design

From the side bar choose "Smart Text".

bffbc459-d2b3-476c-90d3-dba25b911928.png

 

 

Disable Pasting

Once you have chosen Smart Text you will see a checkbox "Disable Pasting". When pasting is disabled, contributors will not be able to paste information in the input text box, regardless of the origin of the information (another judgment, another document on their desktop, from their browser…). Copy/paste is disabled for right click, hotkeys, and keyboard shortcuts.

Screenshot 2023-08-14 at 2.56.14 PM.png

 

Enforce Word Count

To guide contributors to calibrate their input, you can enforce the minimum and maximum word count (also available for cml:text and cml:textarea). Expand "Basic Validation" and Choose "Minimum Word Count" and/or "Maximum Word Count". 

ab59d52f-c30c-49f4-bbb6-c407efe8cbe2.png

If contributors submit fewer words than the minimum set, they will receive the following error message.

c119e8a5-466a-4a59-8e8e-8a97a9fe4a3c.png

If contributors try to submit more words than the maximum set, they will receive the following error message.

604c0365-c118-4982-9e30-da7531adf1b8.png

Spelling and Grammar check

When using Smart Text, you can enable a grammar and spelling check for the input text. Our grammar and spelling check will catch issues with spelling, punctuation, agreement and conjugations.

To add these checks to your job:

  • Click "Save & Close" in the Smart Text set up modal to be taken back to your design page.
  • You will see a new box named “Add custom data validation to your job” in the upper righthand corner of your design page. Click on “Manage custom data validation” to enable grammar and spelling rules for your targeted language.

4ee98fd0-9037-4333-b847-91171913022f.png

 

  • On the next page, click "Add rules".

b0f16eba-d54c-4299-acd1-5ba1cf80af48.png

  • From there, choose “Language check”, then select the language and the locale you want to target. Click Save and the grammar and spelling validation will be applied to the job.

c8116825-09aa-4724-8590-7c956dc7526f.png

4088d545-b0d2-4bf7-b3ae-902b09d35bec.png

6ec2619d-a296-482f-829d-a017a271f83f.png

  • While contributors are typing, spelling and grammar issues will be underlined in red. By clicking the underlined word, contributors will be displayed a suggestion to fix the issue.
  • To accept a suggestion, contributors will click the corrected form of the word. If contributors do not find the suggestions correct, they can click the trash icon to reject the suggestions.

33544431-5300-42e5-b2d4-fe248b64f8f1.png

Smart Text allows you to enforce contributors to accept suggestions to correct typos or to actively ignore these suggestions. This contributes to ensuring proper consistency in spelling and grammar correction throughout the job. Contributors will not be able to submit their judgments without correcting or ignoring the suggestions when you use the cml attribute enforced="true" along with Language Checks.

Currently Supported Languages

  • English, in the following locales:
    • American English
    • British English
    • Australian English
    • Canadian English

Rich Text Editor

 You are now able to design jobs using a Rich Text Editor (RTE). Using our RTE will enable your contributors to format their input text with the following:

  • Tables

  • Code blocks with syntax highlighting for HTML, SQL, Java, Javascript, and more.

  • Bold text

  • Underlined text

  • Italicized text

  • Bullet lists, enabling the representation of two different people similar to a Chatbot

When using Smart Text, Rich Text is enabled by default. Disable Rich Text by unticking the checkbox in the graphical editor. You can also edit the default cml attribute to rich=false.

29111921-2f09-479d-ae6f-6296a43e5570.png

 

Rich Text Output Format

{
ableToAnnotate: <boolean>,
annotations: {
text: "...",

rawContent: "...",

contentType: "html"

},

metadata: { ... }
}

When using the results report, you will also be able to visualize the raw text without html markup for readability. In Quality Flow, any input text formatted with the Rich Text Editor will be displayed in subsequent jobs as formatted by the initial contributor. The reviewer will be able to modify the formatting as needed to improve the output quality.

 

Job Report

Refer to this article for information on Annotation Tools Job Reports.


Was this article helpful?
1 out of 1 found this helpful


Have more questions? Submit a request
Powered by Zendesk