Anal Product Vector

GitHub Repo https://github.com/Rajsh1111/Creating-Product-recommender-using-VectorDB-and-Market-Basket-Analysis

Rajsh1111/Creating-Product-recommender-using-VectorDB-and-Market-Basket-Analysis

No repository description available.

GitHub Repo https://github.com/SEZ9/ShopAnalytica

SEZ9/ShopAnalytica

Using DynamoDB zero-ETL for near real-time vectorization of business data, transferring it to OpenSearch. Combining Bedrock embedded model and LLM model for product recommendations and sentiment analysis on customer reviews.

GitHub Repo https://github.com/sahidesu25/Sentiment-Analysis-on-Amazon-Product-Reviews

sahidesu25/Sentiment-Analysis-on-Amazon-Product-Reviews

With the explosion of social networking sites, blogs and review sites a lot of information is available on the web. This information contains emotions and opinions about various product features and the makers of these products. This form of opinion and feedback is important to the companies developing these products as well as the companies that want to develop better rival products. Sentiment Analysis is the task of analyzing all this data, retrieving opinions about these products and services and classifying them as positive or negative, in other words good or bad. The key parts of any review of any product are the numeric rating and the textual description provided along with this product. In our project we will take into consideration both these vectors for product reviews to conclusively decide on a classifier that is best suited to analysis of product reviews. We have gathered reviews and based on the features that best describe the sentiment for each review, we have created a feature set of 1000 features, and with this limited set we will determine which classifier gives the best result on review type data. To determine the best classifier we perform evaluations on it, by running various data set generators, calculating the resubstitution and generalization errors for each classifier. We then use the mean of these results to compute the paired Student’s t-test to relatively compare the performance of the classifiers. Based on the results of this evaluation, we can state which is the best classifier.

GitHub Repo https://github.com/shahed-dev01/house-feature-vector-analysis

shahed-dev01/house-feature-vector-analysis

Demonstrates foundational linear algebra concepts (vectors, dot products, normalization) and their direct application to data preprocessing and feature analysis in Machine Learning.

GitHub Repo https://github.com/jatinwarade/Sentiment-analysis-using-SVM

jatinwarade/Sentiment-analysis-using-SVM

In this application Support vector machine (SVM) is used to classify movie/product reviews into positive or negative.

GitHub Repo https://github.com/Jai-Agarwal-04/Sentiment_Analysis_with_Insights

Jai-Agarwal-04/Sentiment_Analysis_with_Insights

Sentiment Analysis with Insights using NLP and Dash This project show the sentiment analysis of text data using NLP and Dash. I used Amazon reviews dataset to train the model and further scrap the reviews from Etsy.com in order to test my model. Prerequisites: Python3 Amazon Dataset (3.6GB) Anaconda How this project was made? This project has been built using Python3 to help predict the sentiments with the help of Machine Learning and an interactive dashboard to test reviews. To start, I downloaded the dataset and extracted the JSON file. Next, I took out a portion of 7,92,000 reviews equally distributed into chunks of 24000 reviews using pandas. The chunks were then combined into a single CSV file called balanced_reviews.csv. This balanced_reviews.csv served as the base for training my model which was filtered on the basis of review greater than 3 and less than 3. Further, this filtered data was vectorized using TF_IDF vectorizer. After training the model to a 90% accuracy, the reviews were scrapped from Etsy.com in order to test our model. Finally, I built a dashboard in which we can check the sentiments based on input given by the user or can check the sentiments of reviews scrapped from the website. What is CountVectorizer? CountVectorizer is a great tool provided by the scikit-learn library in Python. It is used to transform a given text into a vector on the basis of the frequency (count) of each word that occurs in the entire text. This is helpful when we have multiple such texts, and we wish to convert each word in each text into vectors (for using in further text analysis). CountVectorizer creates a matrix in which each unique word is represented by a column of the matrix, and each text sample from the document is a row in the matrix. The value of each cell is nothing but the count of the word in that particular text sample. What is TF-IDF Vectorizer? TF-IDF stands for Term Frequency - Inverse Document Frequency and is a statistic that aims to better define how important a word is for a document, while also taking into account the relation to other documents from the same corpus. This is performed by looking at how many times a word appears into a document while also paying attention to how many times the same word appears in other documents in the corpus. The rationale behind this is the following: a word that frequently appears in a document has more relevancy for that document, meaning that there is higher probability that the document is about or in relation to that specific word a word that frequently appears in more documents may prevent us from finding the right document in a collection; the word is relevant either for all documents or for none. Either way, it will not help us filter out a single document or a small subset of documents from the whole set. So then TF-IDF is a score which is applied to every word in every document in our dataset. And for every word, the TF-IDF value increases with every appearance of the word in a document, but is gradually decreased with every appearance in other documents. What is Plotly Dash? Dash is a productive Python framework for building web analytic applications. Written on top of Flask, Plotly.js, and React.js, Dash is ideal for building data visualization apps with highly custom user interfaces in pure Python. It's particularly suited for anyone who works with data in Python. Dash apps are rendered in the web browser. You can deploy your apps to servers and then share them through URLs. Since Dash apps are viewed in the web browser, Dash is inherently cross-platform and mobile ready. Dash is an open source library, released under the permissive MIT license. Plotly develops Dash and offers a platform for managing Dash apps in an enterprise environment. What is Web Scrapping? Web scraping is a term used to describe the use of a program or algorithm to extract and process large amounts of data from the web. Running the project Step 1: Download the dataset and extract the JSON data in your project folder. Make a folder filtered_chunks and run the data_extraction.py file. This will extract data from the JSON file into equal sized chunks and then combine them into a single CSV file called balanced_reviews.csv. Step 2: Run the data_cleaning_preprocessing_and_vectorizing.py file. This will clean and filter out the data. Next the filtered data will be fed to the TF-IDF Vectorizer and then the model will be pickled in a trained_model.pkl file and the Vocabulary of the trained model will be stored as vocab.pkl. Keep these two files in a folder named model_files. Step 3: Now run the etsy_review_scrapper.py file. Adjust the range of pages and product to be scrapped as it might take a long long time to process. A small sized data is sufficient to check the accuracy of our model. The scrapped data will be stored in csv as well as db file. Step 4: Finally, run the app.py file that will start up the Dash server and we can check the working of our model either by typing or either by selecting the preloaded scrapped reviews.

GitHub Repo https://github.com/sherlvick/sentimental-analysis_SVM

sherlvick/sentimental-analysis_SVM

Sentiment analyis of Amazon product reviews using SVM 'rbf':kernel classifier in which word vectorization is done using TF_IDF and CountVectorizer.

GitHub Repo https://github.com/vishwassathish/Sentiment-Analysis-for-product-reviews

vishwassathish/Sentiment-Analysis-for-product-reviews

Sentiment Analysis using LSTM cells on Recurrent Networks. GloVe word embeddings were used for vector representation of words. Amazon Product Reviews were used as Dataset.

GitHub Repo https://github.com/picotech/picosdk-matlab-picovna-vector-network-analyzer-toolbox

picotech/picosdk-matlab-picovna-vector-network-analyzer-toolbox

A Toolbox for use with PicoVNA® Vector Network Analyzer products in MATLAB.

GitHub Repo https://github.com/Thaneshwar-sahu/Price-Feature_Vector_Analysis_for_Beverage_Mug_Product_Optimization

Thaneshwar-sahu/Price-Feature_Vector_Analysis_for_Beverage_Mug_Product_Optimization

This project aims to optimize the price-feature vector for a beverage mug line, identifying the best combination of price and features to enhance market competitiveness. The goal is to determine which features and price points attract consumers, helping to create a product that balances consumer preferences with market trends.