SAS Visual Text Analytics

June 30, 2022

SAS Visual Text Analytics.The purpose of this assignment is to use SAS Visual Text Analytics to analyse a dataset labelled AmazonAlexaReviews available on Moodle as a CSV file.

The purpose of this assignment is to use SAS Visual Text Analytics to analyse a dataset labelled AmazonAlexaReviews available on Moodle as a CSV file.

The dataset consists of 3151 Amazon verified customer reviews of various amazon Alexa products like Alexa Echo, Echo dots, Alexa Firesticks etc. The data also includes star ratings, date of review, and variants of Alexa products. In this assignment, you will use only the customer reviews text, which is named verified_reviews. The verified_reviews variable represents free-form, unstructured customer reviews collected from Amazon’s website.

You are required to conduct a data analysis of Amazon verified customer reviews using SAS Visual Text Analytics in two parts. Part 1 consists of exploring predefined concepts and automatically generated topics to derive insights from the data. Part 2 consists of defining your own custom concepts and custom categories to answer specific research questions.

SAS Visual Text Analytics on G Cloud

SAS Visual Text Analytics provides a comprehensive solution that overcomes the challenges of identifying and categorising text data and offers a wide variety of modeling approaches, including supervised and unsupervised machine learning, linguistic rules, categorisation, entity extraction, sentiment analysis and topic detection.

Features

Self-service discovery. Web-based exploratory analysis
Self-service data preparation. Import data, join tables, manage data
Self-service analytics. Generate and use analytical models
Model development with machine learning algorithms, eg decision forests
Accesses, integrates, profiles, cleanses and transforms data
Text analysis, to gain sentiment and insight from text data
Separates text into words, phrases, punctuation and other elements
Uses unsupervised machine learning to group documents on common themes
Pulls out specific pieces of information or relationships from text
Combines natural language capabilities to build effective text models

Benefits

Dramatically shorten model development time, put models into action sooner
Improve the productivity by reducing manual experimentation and improving collaboration
Visually explore all relevant data smartly, quickly and easily
Create interactive reports by querying data from multiple sources
Import data from a variety of sources: Database, Hadoop, Social
Add geographical context to analyses and visualisations
Integrate with open source development tools and workbenches
Enables data manipulation and data quality operations on data sources
Ability to create mobile apps using the SAS SDK
Collaborative environment to support analytics development across teams Edit

£518 a user a month

Free trial available

Service documents

Request an accessible format

Framework

G-Cloud 12

Service scope

Software add-on or extensionNo

Cloud deployment modelPublic cloud

Service constraintsFor system maintenance SAS carries out third weekend maintenance. SAS will initiate this process and provide customers with advance notice of any planned maintenance.

System requirements

Client computers that run SAS interfaces require modern operating systems
SAS recommends 64-bit web browsers run on 64-bit operating systems
SAS supports 32-bit web browsers run on 32-bit operating systems
SAS requires Google Chrome 61.0 and later
SAS requires Mozilla Firefox 52.0 and later
SAS requires Microsoft Edge 40.1 and later
SAS requires Apple Safari 10.0 and later

Overview

SAS Visual Text Analytics in SAS Viya is a web-based text analytics application that uses context to provide a comprehensive solution to the challenge of identifying and categorizing key textual data. In SAS Visual Text Analytics, you can use the following analysis nodes to build and automate models (based on training documents):

Concepts
Text Parsing
Topics
Categories

You can then customize your models in order to realize the value of your text-based data.

Note: Internet Explorer 11 is not supported for SAS Visual Text Analytics 8.5.

SAS Visual Text Analytics in SAS Viya combines the visual programming flow of SAS Text Miner with the rules-based linguistic methods of categorization and concept extraction in SAS Contextual Analysis. These capabilities, along with document-level scoring for each component, are combined in a single user interface.

Using SAS Visual Text Analytics in SAS Viya, you can identify key textual data in your document collections, build concept and categorization models, and remove meaningless textual data.

By default, words that provide little or no informational value (stop words) are excluded from topic analysis. A default stop list is included and automatically applied for all supported languages. Examples of stop words in English include the articles a, an, and the and conjunctions such as and, or, and but. Other terms that are specific to your document collection but provide little or no value due to their low frequency are also identified and excluded. For more information about stop lists, see Text Mining Action Set: Details in SAS Visual Text Analytics 8.5: Programming Guide.

Visual Text Analytics Basics

SAS Visual Text Analytics provides a number of text analysis nodes that are arranged in a sequence that you control. This sequence takes the form of a pipeline, which empowers you to analyze your document collection with considerable flexibility. When you run a pipeline, the following analyses are performed on data in your project:

The Concepts analysis node in SAS Visual Text Analytics enables you to extract predefined concepts or create additional custom concepts that you can discover in a document or set of documents. For more information about concepts, see Concepts.
The Text Parsing analysis node finds all the terms that are in your document collection. The Text Parsing node uses the default stop list provided for the selected project language to determine which terms are excluded from further analysis. In addition, the Text Parsing node displays useful groups of words such as nouns with their modifiers that can be used for topic discovery. For more information about text parsing, see Terms and Synonyms. For more information about stop lists, see Start Lists and Stop Lists.
The Topics analysis node groups similar documents in a collection into related themes, or topics. The documents in each topic often contain similar subject matter, such as motorcycle accidents, computer graphics, or weather patterns. Automatic topic identification enables you to easily categorize each document in your collection. For more information about topics, see Topics.
The Sentiment analysis node determines whether documents express positive, neutral, or negative attitudes. Analysis performed after the Sentiment Analysis node displays a sentiment indicator for each document. For more information about sentiment scoring, see Sentiment Scoring.
The Categories analysis node labels documents based on their content. You can create categories using these methods:
- Specify category (target) variables in your training documents
- Create new categories that correspond to your organization’s interests
- Add discovered topics as categories
For more information about categories, see Categories.

The models that are generated for Concepts, Sentiment, Topics, and Categories can then be deployed, and used to automate the process of labeling input documents. You can also register your models, which allows for model governance and model change control over time. For more information about registering models, see Registering Models.

Attachments

Click Here To Download