Guangyuan's Research and Development Blog: Building Machine Learning Apps with Hugging Face: LLMs to Diffusion Modeling

The seminar "Building Machine Learning Apps with Hugging Face: LLMs to Diffusion Modeling" from a few weeks ago from Hugging Face provides some interesting insights on LLMs to Diffusion Modeling.

History of MLOps

- Software is eating the world

- Deep learning is eating the world

- Transformers are eating the Deep learning

Underthehood, transfer learning is the key player for transformers eating the deep learning.

Transfer learning

There are many pre-trained models which have been trained on large-scale data that we cannot do by ourselves. But thankfully, we can use the output (the pre-trained models) for our tasks.

So nowadays, we can easily use those models, e.g., via Hugging Face, by following steps:

1. Identify the task matching the problem

- Text, Images, Speech

2. Pick a pre-trained SOTA

- no need to build, label, and clean a large dataset

- 2 lines of code to download and test a model

3. If needed, fine-tune it on your dataset

- build customized layers on top of pre-trained models for our needs

- only train the parameters of the customized layers

Mission of Hugging Face: Democratize Good ML

- open source, community, ethics-first

Some facts about Hugging Face

- Diffusers is growing faster than Transformers

- 150k free public models

- Over 50% of Hub models are private

- 1M model downloads/day

- How we make money? Sell compute services, expert support

- Get started with /tasks

- Bloomz from Bloom: an instruction fine-tuned open source LLM

- Fast ControlNet: Guide image generation

- Mac app for that (Diffusers)

- Free diffusion course from Hugging Face (git)

- Javascript client - huggingface.js

Building ML with Hugging Face

- Explore models

- Manage your models (every model is on github)

- Collaborate

- Use

from transformers import pipeline

- pipeline - encapsulize all - text in text out

AutoTrain

- allows to train based on your data (pricing)

Spaces

- allows you to create your app

Deploy models at scale

- headache about GPUs, CPUs, Docker containers etc.

- Inference Endpoints (we made models plug and play)

LLMs 3types

- Bloom, GPT

- Bloomz, GPT-3: intruction guided

- ChatGPT: Human feedback (RL-based)

Pinecone: vector database company

ModelScope from text to generate video

Building Machine Learning Apps with Hugging Face: LLMs to Diffusion Modeling

No comments:

Post a Comment