Skip to content

Instantly share code, notes, and snippets.

View seopbo's full-sized avatar
๐Ÿค—
On boarding

boseop kim seopbo

๐Ÿค—
On boarding
  • kakao
View GitHub Profile
@lovit
lovit / huggingface_tokenizers_usage.md
Created August 27, 2020 22:28
Hugging Face tokenizers usage
import tokenizers
tokenizers.__version__
@lovit
lovit / huggingface_konlpy.md
Last active November 20, 2024 18:00
huggingface + KoNLPy

Huggingface

  • NLP ๊ด€๋ จ ๋‹ค์–‘ํ•œ ํŒจํ‚ค์ง€๋ฅผ ์ œ๊ณตํ•˜๊ณ  ์žˆ์œผ๋ฉฐ, ํŠนํžˆ ์–ธ์–ด ๋ชจ๋ธ (language models) ์„ ํ•™์Šตํ•˜๊ธฐ ์œ„ํ•˜์—ฌ ์„ธ ๊ฐ€์ง€ ํŒจํ‚ค์ง€๊ฐ€ ์œ ์šฉ
package note
transformers Transformer ๊ธฐ๋ฐ˜ (masked) language models ์•Œ๊ณ ๋ฆฌ์ฆ˜, ๊ธฐํ•™์Šต๋œ ๋ชจ๋ธ์„ ์ œ๊ณต
tokenizers transformers ์—์„œ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋Š” ํ† ํฌ๋‚˜์ด์ €๋“ค์„ ํ•™์Šต/์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋Š” ๊ธฐ๋Šฅ ์ œ๊ณต. transformers ์™€ ๋ถ„๋ฆฌ๋œ ํŒจํ‚ค์ง€๋กœ ์ œ๊ณต
nlp ๋ฐ์ดํ„ฐ์…‹ ๋ฐ ํ‰๊ฐ€ ์ฒ™๋„ (evaluation metrics) ์„ ์ œ๊ณต
@danijar
danijar / blog_tensorflow_variable_sequence_classification.py
Last active December 31, 2021 10:04
TensorFlow Variable-Length Sequence Classification
# Working example for my blog post at:
# http://danijar.com/variable-sequence-lengths-in-tensorflow/
import functools
import sets
import tensorflow as tf
from tensorflow.models.rnn import rnn_cell
from tensorflow.models.rnn import rnn
def lazy_property(function):
@yrevar
yrevar / imagenet1000_clsidx_to_labels.txt
Last active November 21, 2025 06:36
text: imagenet 1000 class idx to human readable labels (Fox, E., & Guestrin, C. (n.d.). Coursera Machine Learning Specialization.)
{0: 'tench, Tinca tinca',
1: 'goldfish, Carassius auratus',
2: 'great white shark, white shark, man-eater, man-eating shark, Carcharodon carcharias',
3: 'tiger shark, Galeocerdo cuvieri',
4: 'hammerhead, hammerhead shark',
5: 'electric ray, crampfish, numbfish, torpedo',
6: 'stingray',
7: 'cock',
8: 'hen',
9: 'ostrich, Struthio camelus',