๐ ๐ผ๐๐ ๐๐ฒ๐ฎ๐บ๐ ๐ณ๐ผ๐ฐ๐๐ ๐ผ๐ป “๐๐๐ถ๐ป๐ด ๐๐”
However, my interest lies in whether ๐๐ ๐ถ๐ ๐๐ฟ๐๐น๐ ๐ฑ๐ฒ๐น๐ถ๐๐ฒ๐ฟ๐ถ๐ป๐ด ๐ฏ๐๐๐ถ๐ป๐ฒ๐๐ ๐ผ๐๐๐ฐ๐ผ๐บ๐ฒ๐.
Utilising AI tools is straightforward, but delivering real value with AI is where it becomes intriguing.
๐๐ฒ๐ฎ๐ฟ๐ป๐ถ๐ป๐ด & ๐ฃ๐ฟ๐ฎ๐ฐ๐๐ถ๐ฐ๐ฒ #๐ญ
When engineers adopt tools like GitHub Copilot or Cursor, they primarily change how they write code. Yet, when an organisation embeds AI into its core business architecture, a deeper transformation occurs:
๐ Traditional engineering practices begin to break down.
This is where an extended skillset becomes essentialโregardless of the terminology we use.
๐๐ฒ๐ฎ๐ฟ๐ป๐ถ๐ป๐ด & ๐ฃ๐ฟ๐ฎ๐ฐ๐๐ถ๐ฐ๐ฒ #๐ฎ
The real shift is from:
๐๐ฆ๐ต๐ฆ๐ณ๐ฎ๐ช๐ฏ๐ช๐ด๐ต๐ช๐ค ๐ด๐บ๐ด๐ต๐ฆ๐ฎ๐ด โ ๐ช๐ง ๐, ๐ต๐ฉ๐ฆ๐ฏ ๐
to
๐๐ณ๐ฐ๐ฃ๐ข๐ฃ๐ช๐ญ๐ช๐ด๐ต๐ช๐ค ๐ด๐บ๐ด๐ต๐ฆ๐ฎ๐ด โ ๐ช๐ง ๐, ๐ต๐ฉ๐ฆ๐ฏ ๐ฑ๐ณ๐ฐ๐ฃ๐ข๐ฃ๐ญ๐บ ๐ .
In discussions with senior tech leaders, one thing is evident:
Managing that uncertainty at scale is not merely a coding problem.
It evolves into:
- A systems design challenge
- A testing and validation challenge
- A cultural shift in our perception of quality.
๐๐ ๐ฎ๐บ๐ฝ๐น๐ฒ: ๐ง๐ฒ๐๐๐ถ๐ป๐ด ๐ฃ๐ฟ๐ผ๐ฏ๐ฎ๐ฏ๐ถ๐น๐ถ๐๐๐ถ๐ฐ ๐ฆ๐๐๐๐ฒ๐บ๐
One area of exploration is how testing must evolve.
We transition from binary correctness to evaluation-based systems (โevalsโ):
- Optimising for statistically significant improvement over time
- Accepting variability while maintaining quality control.
In practice:
- Using LLM-as-a-Judge patterns to assess outputs (factuality, helpfulness, tone)
- Running multiple models in parallel to validate outputs
- Maintaining golden datasets and conducting regression-style evals on every change.
Where I currently stand
Across most areas of the AI SDLC, a hybrid model appears most effective:
- Deterministic where precision and control are vital
- Probabilistic where flexibility and adaptability add value.
The challenge lies not in choosing one approach but in designing systems that effectively balance both.
This remains a working hypothesis that I am actively refining. However, the direction is becoming clearer:
๐ For AI-native delivery (beyond just AI-assisted coding), we must rethink how we design systems, define quality, and evolve engineering practices.
Discussion
Iโm curious about how others are navigating this:
๐๐ณ๐ฆ ๐บ๐ฐ๐ถ ๐ญ๐ฆ๐ข๐ฏ๐ช๐ฏ๐จ ๐ต๐ฐ๐ธ๐ข๐ณ๐ฅ๐ด ๐ฅ๐ฆ๐ต๐ฆ๐ณ๐ฎ๐ช๐ฏ๐ช๐ด๐ต๐ช๐ค, ๐ฑ๐ณ๐ฐ๐ฃ๐ข๐ฃ๐ช๐ญ๐ช๐ด๐ต๐ช๐ค, ๐ฐ๐ณ ๐ฅ๐ฆ๐ด๐ช๐จ๐ฏ๐ช๐ฏ๐จ ๐ง๐ฐ๐ณ ๐ข ๐ฉ๐บ๐ฃ๐ณ๐ช๐ฅ ๐ฎ๐ฐ๐ฅ๐ฆ๐ญ?
๐ฌ๐๐ ๐ฎ๐ฆโ๐’๐ฎ ๐ฆ๐ข๐จ๐ฆ๐ณ ๐ต๐ฐ ๐ค๐ฐ๐ฎ๐ฑ๐ข๐ณ๐ฆ ๐ฏ๐ฐ๐ต๐ฆ๐ด ๐ธ๐ช๐ต๐ฉ ๐ฐ๐ต๐ฉ๐ฆ๐ณ๐ด ๐ต๐ข๐ค๐ฌ๐ญ๐ช๐ฏ๐จ ๐ต๐ฉ๐ช๐ด ๐ด๐ฉ๐ช๐ง๐ต.
Leave a comment