Understanding limits – Cloud Exams and Developing and Operationalizing LLM-based Apps: Exploring Dev Frameworks and LLMOps

LangChain – Developing and Operationalizing LLM-based Apps: Exploring Dev Frameworks and LLMOps

LangChain Like SK, LangChain is another open-source SDK application development framework and toolkit for building modern AI applications with LLMs. It provides out-of-the-box libraries and […]

09/09/2024

AutoGPT – Developing and Operationalizing LLM-based Apps: Exploring Dev Frameworks and LLMOps

AutoGPT Another application that has received a lot of attention in the autonomous agent world is AutoGPT from Mindstream. AutoGPT is an open source application […]

04/10/2024

Benefits of LLMOps – Developing and Operationalizing LLM-based Apps: Exploring Dev Frameworks and LLMOps

Benefits of LLMOps Comparing MLOps and LLMOps While it is evident that MLOps is to machine learning as LLMOps is to LLMs, LLMOps shares many […]

09/20/2023

LLMOps best practices – Developing and Operationalizing LLM-based Apps: Exploring Dev Frameworks and LLMOps

LLMOps best practices As we wrap up this final section, we know that successfully navigating the generative AI and LLM landscape requires effective practice. As […]

02/10/2023

Understanding TPM, RPM, and PTUs 2 – Deploying ChatGPT in the Cloud: Architecture Design and Scaling Strategies

RPM Beyond the TPM limit, an RPM rate limit is also enforced, where the amount of RPM available to a model is set proportionally to […]

09/09/2022

Understanding TPM, RPM, and PTUs – Deploying ChatGPT in the Cloud: Architecture Design and Scaling Strategies

Understanding TPM, RPM, and PTUs As we scale, we will need to understand some additional terminology, such as tokens per minute (TPM), request per minute […]

07/10/2022

Rate Limiting Policy in Azure API Management – Deploying ChatGPT in the Cloud: Architecture Design and Scaling Strategies

Rate Limiting Policy in Azure API Management Rate limiting in Azure API Management is a policy that restricts the number of requests a user can […]

01/09/2022

Costs, training and support – Deploying ChatGPT in the Cloud: Architecture Design and Scaling Strategies

Costs, training and support To round off this chapter on deploying ChatGPT in the cloud with architecture design and scaling strategies, three additional areas are […]

08/15/2021

Application Layer – Deploying ChatGPT in the Cloud: Architecture Design and Scaling Strategies

Application Layer Infrastructure Layer Note: We advise implementing a telemetry solution early to monitor your application’s token usage for prompts and completions. This allows for […]

06/10/2021

Understanding and mitigating security risks in generative AI – Security and Privacy Considerations for Gen AI – Building Safe and Secure LLMs

Understanding and mitigating security risks in generative AI If you are a user of generative AI and NLP LLMs, such as ChatGPT, whether you are […]

01/20/2021

M	T	W	T	F	S	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30