William Skertic’s Post

Engineering leader - MLOps - Data Science - Management

1mo

Here's a concise representation of your ML platform, courtesy of Laszlo Sragner.

1mo

I wrote over the weekend that any notebooks can be rewritten in a day to be production-ready (this went down as well as you'd expect); I also had an article regarding a DS using Angular and Terraform and why this is a bad idea. Lastly, my comment about "Market for Lemons" on the quality of ML consultancies was so successful Maria Vechtomova picked it up into a standalone article. Let's tie all this up with a review of DS skills and my favourite psychological concept, "Self Determination Theory". The idea is to stay motivated to do something; you need three things to be present at the same time: Mastery, Autonomy and Relatedness. For a DS, this translates into: Use skills that you frequently use to get better (Mastery) to resolve problems the way you want to (Autonomy) in long-term projects that you can feel yours, be proud of, and work with others (Relatedness). How does this relate to DSes doing Terraform? The DS/ML/AI space is incredibly complex, so much so that participants suffer from "Bounded Rationality". This means that no matter how much you try, you won't be able to comprehend it in its entirety. No surprise, given the top of the stack (most abstract), is essentially business analysis and business strategy (cue BI and "data stories"), and the bottom of the stack (most concrete) is pretty much an inch about hardware (Kubernetes, low-level YAML hacking and Terraform). How do you expect the same person to resolve this at the same time? If you are fighting BR, you need to forget something in order to remember the details of something else. "Shuffle it out of the cache" with computer speak. This instantly hurts Mastery, given you are context-switching all the time. The other failure mode is being responsible for rarely used technologies, which again counters Mastery. This is why trying to do Terraform on top of your BI and ML/AI (and some DE) work is a bad idea. You are just making your BR situation worse. Low Mastery also hurts Autonomy, as you will feel that you are unable to do what you want as you are not good enough, but you can't get better as you constantly need to shift out of these skills or use it so rarely that you don't have a chance to improve. This was recognised at the beginning of the MLOps era, and MLOps engineers were born to help out DSes. This was a good idea, but now the homogenous DS cohort is split into two different types of mindsets. And you still need to allocate each skill into the two roles. The decision was to make the DSes stick to high-level business issues and MLOps Engs to low-level technical ones. Immediately, running into the problem of how to coordinate between these two. There are two main concepts: Assembly Line and Abstraction. In "Assembly Line", DSes write POCs in notebooks and "throw them over the wall" to MLEs, who will essentially re-implement them, rediscovering all the decisions the DSes made during the POC phase. [continues on my blog, don't forget to subscribe]

To view or add a comment, sign in

More Relevant Posts

Laszlo Sragner
1mo
Report this post
I wrote over the weekend that any notebooks can be rewritten in a day to be production-ready (this went down as well as you'd expect); I also had an article regarding a DS using Angular and Terraform and why this is a bad idea. Lastly, my comment about "Market for Lemons" on the quality of ML consultancies was so successful Maria Vechtomova picked it up into a standalone article. Let's tie all this up with a review of DS skills and my favourite psychological concept, "Self Determination Theory". The idea is to stay motivated to do something; you need three things to be present at the same time: Mastery, Autonomy and Relatedness. For a DS, this translates into: Use skills that you frequently use to get better (Mastery) to resolve problems the way you want to (Autonomy) in long-term projects that you can feel yours, be proud of, and work with others (Relatedness). How does this relate to DSes doing Terraform? The DS/ML/AI space is incredibly complex, so much so that participants suffer from "Bounded Rationality". This means that no matter how much you try, you won't be able to comprehend it in its entirety. No surprise, given the top of the stack (most abstract), is essentially business analysis and business strategy (cue BI and "data stories"), and the bottom of the stack (most concrete) is pretty much an inch about hardware (Kubernetes, low-level YAML hacking and Terraform). How do you expect the same person to resolve this at the same time? If you are fighting BR, you need to forget something in order to remember the details of something else. "Shuffle it out of the cache" with computer speak. This instantly hurts Mastery, given you are context-switching all the time. The other failure mode is being responsible for rarely used technologies, which again counters Mastery. This is why trying to do Terraform on top of your BI and ML/AI (and some DE) work is a bad idea. You are just making your BR situation worse. Low Mastery also hurts Autonomy, as you will feel that you are unable to do what you want as you are not good enough, but you can't get better as you constantly need to shift out of these skills or use it so rarely that you don't have a chance to improve. This was recognised at the beginning of the MLOps era, and MLOps engineers were born to help out DSes. This was a good idea, but now the homogenous DS cohort is split into two different types of mindsets. And you still need to allocate each skill into the two roles. The decision was to make the DSes stick to high-level business issues and MLOps Engs to low-level technical ones. Immediately, running into the problem of how to coordinate between these two. There are two main concepts: Assembly Line and Abstraction. In "Assembly Line", DSes write POCs in notebooks and "throw them over the wall" to MLEs, who will essentially re-implement them, rediscovering all the decisions the DSes made during the POC phase. [continues on my blog, don't forget to subscribe]
12 Comments
Like Comment
To view or add a comment, sign in
Glen Wright Colopy, DPhil

📊 Head of Data Science & Statistics at Wildfell Software 🛠️ Data Science as a Service for Startups
1mo
Report this post
If you're a (i) data scientist who's heavily on the business case or domain expert side of #datascience, and (ii) perennially feeling cut off or disconnected from the deployment of your work, then this hugely useful post & diagram by Laszlo Sragner may help clarify things. Not surprising that this is coming from Laszlo - he's not just one of the funniest DS/ML personalities on LinkedIn, it's also obvious to anyone who reads his posts that he spend FAR MORE time thinking than posting (a rare quality!)....and here's a perfect example. I'm glad that Laszlo Sragner incorporates how people think & learn into his reasoning on how to organize technical work. This assembly line (horizontal separation) model of DS-MLOps collaboration really describes the common pain points we've been seeing as DS irons itself out in practice. I'm looking forward to the rest of his blog to read more details about this abstraction (vertical separation) model. What do you think, Todd Deshane & Sam Moreland?
Laszlo Sragner
1mo

I wrote over the weekend that any notebooks can be rewritten in a day to be production-ready (this went down as well as you'd expect); I also had an article regarding a DS using Angular and Terraform and why this is a bad idea. Lastly, my comment about "Market for Lemons" on the quality of ML consultancies was so successful Maria Vechtomova picked it up into a standalone article. Let's tie all this up with a review of DS skills and my favourite psychological concept, "Self Determination Theory". The idea is to stay motivated to do something; you need three things to be present at the same time: Mastery, Autonomy and Relatedness. For a DS, this translates into: Use skills that you frequently use to get better (Mastery) to resolve problems the way you want to (Autonomy) in long-term projects that you can feel yours, be proud of, and work with others (Relatedness). How does this relate to DSes doing Terraform? The DS/ML/AI space is incredibly complex, so much so that participants suffer from "Bounded Rationality". This means that no matter how much you try, you won't be able to comprehend it in its entirety. No surprise, given the top of the stack (most abstract), is essentially business analysis and business strategy (cue BI and "data stories"), and the bottom of the stack (most concrete) is pretty much an inch about hardware (Kubernetes, low-level YAML hacking and Terraform). How do you expect the same person to resolve this at the same time? If you are fighting BR, you need to forget something in order to remember the details of something else. "Shuffle it out of the cache" with computer speak. This instantly hurts Mastery, given you are context-switching all the time. The other failure mode is being responsible for rarely used technologies, which again counters Mastery. This is why trying to do Terraform on top of your BI and ML/AI (and some DE) work is a bad idea. You are just making your BR situation worse. Low Mastery also hurts Autonomy, as you will feel that you are unable to do what you want as you are not good enough, but you can't get better as you constantly need to shift out of these skills or use it so rarely that you don't have a chance to improve. This was recognised at the beginning of the MLOps era, and MLOps engineers were born to help out DSes. This was a good idea, but now the homogenous DS cohort is split into two different types of mindsets. And you still need to allocate each skill into the two roles. The decision was to make the DSes stick to high-level business issues and MLOps Engs to low-level technical ones. Immediately, running into the problem of how to coordinate between these two. There are two main concepts: Assembly Line and Abstraction. In "Assembly Line", DSes write POCs in notebooks and "throw them over the wall" to MLEs, who will essentially re-implement them, rediscovering all the decisions the DSes made during the POC phase. [continues on my blog, don't forget to subscribe]
2 Comments
Like Comment
To view or add a comment, sign in
Nathan "CAP'N" COOK

SFTE Fellow | Principal Flight Test Engineer @ Hermeus
1mo
Report this post
My first impression is that this framing also applies to operators (who need the system-wide big picture) and discipline engineers (who need to focus on implementation details)...
Laszlo Sragner
1mo

I wrote over the weekend that any notebooks can be rewritten in a day to be production-ready (this went down as well as you'd expect); I also had an article regarding a DS using Angular and Terraform and why this is a bad idea. Lastly, my comment about "Market for Lemons" on the quality of ML consultancies was so successful Maria Vechtomova picked it up into a standalone article. Let's tie all this up with a review of DS skills and my favourite psychological concept, "Self Determination Theory". The idea is to stay motivated to do something; you need three things to be present at the same time: Mastery, Autonomy and Relatedness. For a DS, this translates into: Use skills that you frequently use to get better (Mastery) to resolve problems the way you want to (Autonomy) in long-term projects that you can feel yours, be proud of, and work with others (Relatedness). How does this relate to DSes doing Terraform? The DS/ML/AI space is incredibly complex, so much so that participants suffer from "Bounded Rationality". This means that no matter how much you try, you won't be able to comprehend it in its entirety. No surprise, given the top of the stack (most abstract), is essentially business analysis and business strategy (cue BI and "data stories"), and the bottom of the stack (most concrete) is pretty much an inch about hardware (Kubernetes, low-level YAML hacking and Terraform). How do you expect the same person to resolve this at the same time? If you are fighting BR, you need to forget something in order to remember the details of something else. "Shuffle it out of the cache" with computer speak. This instantly hurts Mastery, given you are context-switching all the time. The other failure mode is being responsible for rarely used technologies, which again counters Mastery. This is why trying to do Terraform on top of your BI and ML/AI (and some DE) work is a bad idea. You are just making your BR situation worse. Low Mastery also hurts Autonomy, as you will feel that you are unable to do what you want as you are not good enough, but you can't get better as you constantly need to shift out of these skills or use it so rarely that you don't have a chance to improve. This was recognised at the beginning of the MLOps era, and MLOps engineers were born to help out DSes. This was a good idea, but now the homogenous DS cohort is split into two different types of mindsets. And you still need to allocate each skill into the two roles. The decision was to make the DSes stick to high-level business issues and MLOps Engs to low-level technical ones. Immediately, running into the problem of how to coordinate between these two. There are two main concepts: Assembly Line and Abstraction. In "Assembly Line", DSes write POCs in notebooks and "throw them over the wall" to MLEs, who will essentially re-implement them, rediscovering all the decisions the DSes made during the POC phase. [continues on my blog, don't forget to subscribe]
Like Comment
To view or add a comment, sign in
Michael Vandi

CEO at Addy AI • Ex AWS Software Engineer • Carnegie Mellon Alum
8mo
Report this post
⚡LangDrive update: Auto-configure Python environment for LLM training 🛠️ Train 100+ LLMs on your private data directly from your CLI. 🔥 The new version of LangDrive (https://docs.langdrive.ai) comes with a shell script that runs the commands needed to configure a Python environment to run the training image. Clone LangDrive on your machine and run: cd src/train && ./run.sh

Welcome to Langdrive's Documentation Portal

docs.langdrive.ai
Like Comment
To view or add a comment, sign in
Aaron Bach

CTO @ Liminal
9mo
Report this post
I'll admit that when GitHub first released Copilot, I was skeptical that it was worth the time. It took me a while to realize why: I had an irrational fear that Copilot's advancement would somehow invalidate the zillions of hours we, the technology community, have put into honing our skills and craft. I didn't like the idea that a machine could "do my work for me." One day, that mentality changed, and I installed it. Over five to six weeks, I saw my own development capabilities skyrocket. At the time, I was building a product from the ground up and shaking off kilometers of Terraform rust. Although my muscle memory was geared toward StackOverflow'ing every question, I slowly started to lean on Copilot. In short order, I had recaptured the magic of IaaC (most importantly, I got our product delivered on time). I leaned on Copilot more and more, both professionally and in my open-source work. When I needed suggestions on an algorithm, Copilot offered them; when I was not too fond of writing another unit test, it gave me one for free; when I couldn't remember a particular TypeScript syntax because I lived in Python, it came through. I had begun to swing away from Copilot doubter to believer. Co-founding Liminal gave me a necessary pause. In the proper context, Copilot—and all LLMs—are valuable tools, but without thinking of their internal security, I run the risk of unthinkingly following advice and turning from "Copilot-augmented AaronDev" into "Copilot-AaronDrone." A case in point is a recent Cornell paper on the security weaknesses of Copilot-generated code. TL;DR: 35.8% of Copilot-generated code contains critical vulnerabilities—even more scary, of the 42 vulnerabilities produced, 11 are in the 2022 CWE Top-25. This outcome is a lesson for me and everyone: wherever generative AI takes us, we must remain in command. The very best of this generational leap will come when we align what it offers with our own intuition. Stay frosty, dev friends. #LLM #GenAI #Security

Security Weaknesses of Copilot Generated Code in GitHub

arxiv.org

1 Comment
Like Comment
To view or add a comment, sign in
Martin Araya

Developing projects on the web with Cloud and AI | Azure & GCP | OpenAI & Mistral | I write 2 to 3 post per day
2mo
Report this post
✨ Martin's Test Questions: Rust Part 1 ✨ 1️⃣ How would you implement a function to find the maximum of a list of numbers in Rust? ➡ In Rust, you can use the max() method provided by the list's iterator to find the maximum value. This method returns None if the list is empty and Some with the maximum value if it contains elements. 2️⃣ Describe a scenario where you would use a for loop instead of a while loop in Rust. ➡ I would use a for loop instead of a while loop when I need to iterate over a collection of items or a specific range of values in a sequential and concrete manner, where the number of iterations is known beforehand or is defined by the structure being iterated over, such as a vector or a range of numbers. 3️⃣ Explain how you can use pattern matching to simplify code that handles multiple options. ➡ Pattern matching in Rust allows for more elegant and safe handling of the different possibilities a variable can have, especially useful with enums. For example, instead of using multiple if or match statements to unpack a Result or Option, pattern matching allows each case to be addressed directly and clearly, managing both expected values and errors or null values explicitly in the same block of code. 4️⃣ How would you implement a recursive function in Rust? What precautions would you take? ➡ To implement a recursive function in Rust, I would simply define a function that calls itself within its definition. Important precautions include making sure there is a clear stop condition to prevent infinite recursion and considering the use of the call stack. Rust has a fixed-size stack, meaning very deep recursion can lead to stack overflow. For cases where deep recursion is necessary, I could use techniques such as tail recursion, where Rust can optimize recursive calls to avoid consuming additional stack space. 5️⃣ How can you improve the efficiency of a function that performs repetitive operations on a large collection of data? ➡ For functions processing large volumes of data, using iterators and methods such as map, filter, and fold can increase efficiency by avoiding intermediate collections. Using parallelism with libraries like Rayon to leverage multiple processor cores can also be beneficial. ❗ Follow me on LinkedIn to continue reading more posts like this one. It is one more in a series of posts, articles, and guides on project technologies, payment gateway technologies, AI, DevOps, and Cloud solutions (Azure, AWS, and Oracle). #RustLang #Programming #CodeOptimization #SoftwareDevelopment #TechTips #RustProgramming #FunctionalProgramming #SystemProgramming #Concurrency #ParallelComputing #CodeEfficiency #SoftwareEngineering #DeveloperTools #TechCommunity #Rustaceans
Like Comment
To view or add a comment, sign in
Devaraj Gowdanar

Co-Founder & CTO at Ascend DeFi Labs | Co-Founder & CTO at OXIVIVE LIFE CARE PVT LTD | Chairman at GTPL | Entrepreneur | Pre-seed & Seed stage Investor | Board Member | Blockchain enthusiast|Tokenomics Consulting.
1w
Report this post
Top 11 Factors to Consider When Designing Cloud-Native Applications in Python🐍🌩️ Building cloud-native applications can revolutionize how businesses operate, offering scalability, flexibility, and resilience. As Python continues to dominate the development landscape, it's crucial to consider certain factors to ensure your cloud-native applications are robust and efficient. Here are 11 key factors to keep in mind: 1. Microservices Architecture: Break down your application into independent, loosely coupled microservices to enhance scalability and maintainability. 2. Containerization: Use Docker or Kubernetes to containerize your Python applications for consistent development, testing, and deployment environments. 3. API Gateway: Implement an API gateway to manage communication between microservices, handle auth, rate limiting, and more. 4. Serverless Computing: Leverage serverless platforms (e.g., AWS Lambda) for specific functions to reduce overhead and improve scalability. 5. Scalability: Design for horizontal scalability, using tools like load balancers and auto-scaling groups. 6. Observability: Incorporate logging, monitoring, and tracing (e.g., Prometheus, Grafana) from the start for greater insight into application performance. 7. Security: Ensure robust security practices such as implementing firewalls, encrypting data, and adhering to compliance standards. 8. CI/CD Pipelines: Adopt Continuous Integration and Continuous Deployment practices to automate testing and deployment, ensuring faster and safer releases. 9. Fault Tolerance: Design your application to gracefully handle failures and ensure that services can recover quickly without significant downtime. 10. Data Management: Use cloud-native databases and storage solutions that offer scalability, durability, and easy backup and recovery options. 11. DevOps Culture: Foster a DevOps culture to enhance collaboration between development and operations teams, ensuring a smoother workflow and quicker problem-solving. Let’s leverage the power of Python and the cloud to build the future! 🌐💻 #CloudNative #Python #Microservices #DevOps #Containerization #Serverless #Scalability #Observability #Security #CICD #FaultTolerance #DataManagement
Like Comment
To view or add a comment, sign in
AI‐TechPark

15,692 followers
5mo
Report this post
Aqua Security wins 2023 Global SSCS Technology Innovation Leadership Know more:-https://lnkd.in/dm96gvpD #cloudsecurity #artificialintelligence #aitechparknews #aitechnology #technology #innovation #aitech #machinelearning #python #deeplearning

Aqua Security wins 2023 Global SSCS Technology Innovation Leadership

https://ai-techpark.com
Like Comment
To view or add a comment, sign in
Roman Glushach

Principal Software Engineer | 15+ years of experience in industry | Full-Stack Developer (Java, NodeJS, React) | Building solutions at scale
12mo
Report this post
💻🐳 Docker images are composed of layers, which are snapshots of changes made to the file system. Each layer is immutable, meaning it cannot be modified once it is created. When a container is launched from an image, #docker adds a writable layer on top of the image layers, where any changes made by the #container are stored. This way, multiple containers can share the same image layers, but have their own writable layer. Docker images can be created from scratch, by writing a `Dockerfile` that specifies the instructions for building the image, or by pulling an existing image from a registry, such as Docker Hub. Docker Hub is a public repository of images that anyone can use or #contribute to. There are also private registries that can be used to store and distribute images within an #organization or a team. Docker images are essential for developing, testing, and deploying applications using containers. They provide a consistent and portable way of packaging and running #applications across different environments. By using Docker images, developers can focus on writing code, rather than worrying about the infrastructure and dependencies. #kubernetes Take a look at the article to know more about Docker Images: https://lnkd.in/gfw-ixiN

Docker Images: A Deep Dive into Container Technology

romanglushach.medium.com
Like Comment
To view or add a comment, sign in
Fasih Khatib

Senior Software Engineer @ Sense
4mo
Report this post
I'd previously written about architecture as code, and my new post talks about setting up Sphinx documentation for a Flask microservice. The idea is to make it easy to generate documentation, and review it as a part of the PR review process. Although Python-specific, the steps outlined in the post can hopefully be applied to any framework or language of your choice. https://lnkd.in/dPNZqcGh

Setting up Sphinx Documentation

fasihkhatib.com
Like Comment
To view or add a comment, sign in

1,378 followers

16 Posts

View Profile Follow

William Skertic’s Post

More Relevant Posts

Explore topics