Real Work

clone from Jerry's github (LFS301 & common)

munakata@PGMIT:~/program$ git clone https://github.com/lftraining/LFS301
Cloning into 'LFS301'...
Username for 'https://github.com': hmunak
Password for 'https://hmunak@github.com':
remote: Enumerating objects: 11955, done.
remote: Counting objects: 100% (516/516), done.
remote: Compressing objects: 100% (371/371), done.
remote: Total 11955 (delta 264), reused 296 (delta 144), pack-reused 11439
Receiving objects: 100% (11955/11955), 13.00 MiB | 4.50 MiB/s, done.
Resolving deltas: 100% (8949/8949), done.

munakata@PGMIT:~/program$ git clone https://github.com/lftraining/common
Cloning into 'common'...
Username for 'https://github.com': hmunak
Password for 'https://hmunak@github.com':
remote: Enumerating objects: 2333, done.
remote: Counting objects: 100% (198/198), done.
remote: Compressing objects: 100% (124/124), done.
remote: Total 2333 (delta 75), reused 172 (delta 58), pack-reused 2135
Receiving objects: 100% (2333/2333), 3.22 MiB | 2.32 MiB/s, done.
Resolving deltas: 100% (1238/1238), done.

munakata@PGMIT:~/program$ ls -al
total 16
drwxr-xr-x  4 munakata munakata 4096 Jul 22 07:29 .
drwxr-xr-x  8 munakata munakata 4096 Jul 22 07:23 ..
drwxr-xr-x  6 munakata munakata 4096 Jul 22 07:28 LFS301
drwxr-xr-x 10 munakata munakata 4096 Jul 22 07:29 common

create work directory (LFS301-JP)

Ubuntu packages install

munakata@PGMIT:~/program/LFS301$ sudo apt update
munakata@PGMIT:~/program/LFS301$ sudo apt upgrade
munakata@PGMIT:~/program/LFS301$ sudo apt install texlive-full
munakata@PGMIT:~$ sudo apt install python3-pip
munakata@PGMIT:~/program/LFS301$ sudo apt install python3-pygments

python3 & pip alias setting (added to .bashrc)

alias python="python3" 
alias pip="pip3" 

Try to generate LFS301.pdf

Try to generate LFS301-JP.pdf

Reference Files (pulled from following sepatae repos)

Git repos

Google Drive

日本語ファイル(e-learning 用に翻訳したもの)

ソースファイル

行数確認

英語ファイル

処理プロトコル案1・・・失敗(detex の精度が低くテキストの再現性が低い)

処理プロトコル案2・・・pdf からテキストをインデックス化し tex ファイルに反映

plagiarism detection python (盗用・剽窃チェック) アルゴリズムの検討

単純な文章の類似度判定

pdftotext と detex で特殊文字が違ったエンコードになった(些細だが、ハマる)

'What is "Cloud Native" and how it works?' <----- detex (from Tex)
'What is ”Cloud Native” and how it works?\n'  <---- pdftotext (from PDF)

トップ   新規 一覧 検索 最終更新   ヘルプ   最終更新のRSS