Taeyoon.Kim.DS

The Transformer architecture

https://www.youtube.com/watch?v=H39Z_720T5s&list=PLo2EIpI_JMQvWfQndUesu0nPBAtZ9gP1o&index=5 Encoders, decoders, encoder-decoders Encoder accepts text into numerical representations. the combination of the two parts is known as an encoder-decoder, or seq-seq trnsformer. https://www.youtube.com/watch?v=MUqNwgPjJvQ&list=PLo2EIpI_JMQvWfQndUesu0nPBAtZ9gP1o&index=6 BERT is a popular encoder. Welcome t..

Hugging Face Course 2023. 9. 13. 19:14

What is Transfer Learning

https://www.youtube.com/watch?v=BqqfQnyjmgg&list=PLo2EIpI_JMQvWfQndUesu0nPBAtZ9gP1o&index=4 Training from scratch --> Fine-tuning a pretrained model. ImageNet is commnly used as a dataset for pretraining models in CV. GPT-2 was pretrained on 40GB of internet text postsed by users on Reddit. BERT was pretrained on the content of the English wikipedia and 11,000 unpublished books. ImageNet content..

Hugging Face Course 2023. 9. 13. 18:43

The pipeline function

https://www.youtube.com/watch?v=tiZFewofSLM&list=PLo2EIpI_JMQvWfQndUesu0nPBAtZ9gP1o&index=2 from transformers import pipeline classifier = pipeline("sentiment-analysis", "zero-shot-classification", "text-generation",) classifier("Text"), candidate_labels=["A","B",C"] --> Softmax type.

Hugging Face Course 2023. 9. 13. 18:37

Pytorch 컨테이너 생성 후 사용

docker pull pytorch/pytorch:2.0.1-cuda11.7-cudnn8-devel docker run -it --gpus all -p 8888:8888 04c0663041d9 으로 container 실행. jupyter lab --ip=0.0.0.0 --port=8888 --allow-root 으로 jupyter lab 실행. !apt-get update \ && apt-get install -y wget unzip vim git 을 새로운 jupyter note에서 실행. git clone https://github.com/ewankim1023/yolov5_custom_submodule.git 을 Terminal에서 실행. git config --global user.email tae..

데이터 과학 2023. 8. 31. 21:47

Scaling human feedback

https://www.coursera.org/learn/generative-ai-with-llms/lecture/eJVnL/scaling-human-feedback https://www.coursera.org/learn/generative-ai-with-llms/lecture/eJVnL/scaling-human-feedback تحميل Lädt... Chargement... Loading... Cargando... Carregando... Загрузка... Yükleniyor... 载入中 www.coursera.org While reward models can replace human evaluation in RLHF fine-tuning, creating the initial labeled dat..

Generative AI with Large Language Models 2023. 8. 28. 21:59

RLHF: Reward hacking

https://www.coursera.org/learn/generative-ai-with-llms/lecture/eJVnL/scaling-human-feedback https://www.coursera.org/learn/generative-ai-with-llms/lecture/eJVnL/scaling-human-feedback تحميل Lädt... Chargement... Loading... Cargando... Carregando... Загрузка... Yükleniyor... 载入中 www.coursera.org Reinforcement Learning from Human Feedback (RLHF) aligns LLMs with human preferences using a reward mo..

Generative AI with Large Language Models 2023. 8. 28. 21:43

RLHF: Fine-tuning with reinforcement learning

https://www.coursera.org/learn/generative-ai-with-llms/lecture/sAKto/rlhf-fine-tuning-with-reinforcement-learning RLHF: Fine-tuning with reinforcement learning - Week 3 | Coursera Video created by deeplearning.ai, Amazon Web Services for the course "Generative AI with Large Language Models". Reinforcement learning and LLM-powered applications www.coursera.org To align the instruction-fine-tuned ..

Generative AI with Large Language Models 2023. 8. 28. 21:33

RLHF: Reward model

https://www.coursera.org/learn/generative-ai-with-llms/lecture/Wf1jL/rlhf-reward-model RLHF: Reward model - Week 3 | Coursera Video created by deeplearning.ai, Amazon Web Services for the course "Generative AI with Large Language Models". Reinforcement learning and LLM-powered applications www.coursera.org At this stage, all the necessary components are in place to train the reward model. Althou..

Generative AI with Large Language Models 2023. 8. 28. 21:27

Taeyoon.Kim.DS

고정 헤더 영역

메뉴 레이어

메뉴 리스트

검색 레이어

검색 영역

전체 글

추가 정보

인기글

최신글

페이징

티스토리툴바