Skip to content
This repository has been archived by the owner on Mar 22, 2024. It is now read-only.

leksikov/archive_word2vec_kr_java

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 

Repository files navigation

Java implementation of Word2Vec based on Korean text

This project crawl the dataset from Korean movie script description and review web service.

https://movie.naver.com

This dataset selected because movies have variety of genres which describe different types of situatons. Given the dataset word embedding are implemented using word2vec language model.

Programming flow

Crawling and data storage -> Text processing (removing stop words) -> Dataset one-hot vector encoding -> Neural Network module -> training NN -> resulted embedding -> statistics and visualization

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages