What are you hosting? Can you say what the application is?
Discussion
I've experimented with smaller models, such as 7 billion and 13 billion. When comparing Falcon (13 million parameters) to Lama (13 billion parameters), Falcon clearly outperforms it.
However, caution is necessary since we are still in the early stages of development, with much ongoing progress.
You can run by yourself, just click on colab and play button it will work. You can also attach your storage to train it. It has all the models.