Petals

What is Petals?

The Petal tool is a decentralized platform that runs a large language model like Bloom-176b. It is capable of loading small parts of the model to run inference and fine-tuning. Single-batch inference takes approximately 1 second per step (token) and can run parallel inference up to hundreds of tokens/sec. This tool offers more than just a classic language model API, using fine-tune sampling methods and allowing users to execute custom paths and see hidden states. Petal also offers flexible PyTorch API. It is part of the BigScience research workshop project.

Pricing:
Categories:

KEY FEATURES

  • ✔️ Decentralized platform.
  • ✔️ Large language model like bloom-176b.
  • ✔️ Single-batch inference.
  • ✔️ Fine-tuning.
  • ✔️ Flexible pytorch api.

USE CASES

  1. Natural language processing.
  2. Text generation.
  3. Sentiment analysis.
No reviews yet
Write a review
Name*
Email
Enter your comment*