Tags
- Reinforcement Learning for Human Feedback
- ai agents
- community
- dataiku
- deployments
- gemma
- generative ai
- gpt4
- kiji
- llm
- machine learning
- mentorship
- mlops
- model deployment
- model profiling
- open source
- openai
- pii
- privacy
- rlhf
- serving
- software engineering
- speculative decoding
- startups
- tensorflow
- tensorflow extended
- vibe coding
- vllm
Reinforcement Learning for Human Feedback
ai agents
community
dataiku
deployments
gemma
generative ai
gpt4
kiji
llm
machine learning
- Highly interesting Twitter threads to revisit from time to time
- Receiving Google Open Source Peer Bonus Award 2022
- Notes on GPT4
- Notes on Model Performance Profiling
- Notes on Reinforcement Learning for Human Feedback
- Notes on deploying models with TFServing
- How to Profile TensorFlow Serving Inference Requests with TFProfiler
- Speculative Decoding with vLLM
- Deploying Google's Gemma on Vertex AI
- Speculative Decoding with vLLM using Gemma
- Kiji Privacy Proxy™ - Protecting Your Data in the Age of Generative AI