Clone the repository
Deployment
Test call
First, create an API key here. Then run the following cURL commands:Fal endpoints expect the
Key prefix in the Authorization header.Documentation Index
Fetch the complete documentation index at: /llms.txt
Use this file to discover all available pages before exploring further.
🚀 New: LFM2.5-VL-450M — our smallest vision model is now available! Learn more →
Fal is a serverless generative media platform offering lightning-fast inference for AI models for image, video, and audio generation.
git clone https://github.com/Liquid4All/lfm-inference
cd fal
# run one-off server
fal run deploy-lfm2.py::serve
# run production server
fal deploy deploy-lfm2.py::serve --app-name lfm2-8b --auth private
export FAL_API_KEY=<your-fal-api-key>
# List deployed model
curl https://fal.run/<org-id>/<app-id>/v1/models -H "Authorization: Key $FAL_API_KEY"
# Query the deployed LFM model
curl -X POST https://fal.run/<org-id>/<app-id>/v1/chat/completions \
-H "Authorization: Key $FAL_API_KEY" \
-d '{
"model": "LiquidAI/LFM2-8B-A1B",
"messages": [
{
"role": "user",
"content": "What is the melting temperature of silver?"
}
],
"max_tokens": 32,
"temperature": 0
}'
Key prefix in the Authorization header.Was this page helpful?