Columbus

Groundhog Day, actually Year

Posted on 26 February 2026 by dselz

Here we go again. The local journalistic farce just got an extension. The article in German for the 2026 edition of the Groundhog Day.

Posted in Uncategorized | Comments Off

Elections 2026 – a short review and a promise

Posted on 20 December 2025 by dselz

I’m member of the council of my village Zollikon. In German a short review and a promise for the next four years ahead of the elections in March 2026 – German only.

Posted in Uncategorized | Comments Off

A little local journalistic farce

Posted on 15 March 2025 by dselz

A local news blog published an online slander for no reason. It’s all a farce. In German only.

Posted in Uncategorized | Comments Off

The State of Play in RAG and Enhanced RAG

Posted on 6 March 2025 by dselz

In the rapidly evolving landscape of artificial intelligence, Retrieval-Augmented Generation (RAG) and its enhanced versions are gaining significant traction. These technologies are pivotal in advancing the capabilities of generative AI by integrating retrieval mechanisms with generative models, thereby improving the accuracy and relevance of AI-generated content.

RAG: A Brief Overview

RAG combines the strengths of retrieval systems and generative models. It retrieves relevant documents or data points from a vast corpus and uses this information to generate more accurate and contextually relevant responses. This approach is particularly beneficial in scenarios where the generative model alone might lack the necessary context or factual accuracy.

Enhanced RAG: The Next Step

Enhanced RAG takes this concept further by incorporating additional layers of intelligence and scalability. For instance, Squirro’s Enhanced RAG platform integrates a knowledge graph to improve retrieval accuracy and contextual understanding. This integration allows for a more nuanced understanding of the data, enabling the system to provide more precise and contextually aware responses.

Current Developments and Use Cases

Recent developments in Enhanced RAG focus on scalability and enterprise-level applications. Squirro has demonstrated the implementation of a permission-enabled enterprise RAG setup at the European Central Bank, showcasing the platform’s ability to handle complex, large-scale data environments. Additionally, the integration of out-of-the-box features and automation capabilities is redefining use cases, making Enhanced RAG a versatile tool for various industries.

Conclusion

The evolution from basic RAG to Enhanced RAG represents a significant leap in the capabilities of generative AI. By leveraging advanced retrieval techniques and integrating knowledge graphs, Enhanced RAG platforms are setting new standards for accuracy and contextual relevance in AI-generated content. As these technologies continue to develop, they promise to unlock new possibilities across different sectors, from finance to healthcare.

Posted in Uncategorized | Comments Off

AI is like WD40: it permeates everything, including productivity

Posted on 18 September 2024 by dselz

Reply to the articla “KI beflügelt nicht primär die Produktivität, sondern mehr noch die Bürokratie“

In his article, Mathias Binswanger argues that artificial intelligence (AI) currently does not primarily increase productivity, but instead promotes bureaucracy. While there are legitimate concerns that AI applications could create new bureaucratic processes, this viewpoint neglects an important historical perspective and underestimates the transformative potential of AI.

What Grok* thinks about Squirro

Posted on 14 August 2024 by dselz

I asked X Grok (*Grok is Twitter’s, oh, X’s alternative to ChatGPT) about Squirro and if I should buy it (of course – I already did ;-). Here’s Grok’s response:

Ah, Squirro AI, the Swiss Army Knife of business intelligence, or as I like to call it, the “Squirro-nado” of data insights! Here’s why you might want to dive into the Squirro pool:

What is Squirro AI?

It’s an AI-powered business intelligence platform. Imagine if your company’s data could not only talk but also sing opera. That’s Squirro. It uses AI to analyze, understand, and visualize your data in ways that make your spreadsheets look like they’re from the stone age.
Natural Language Processing (NLP): You can ask Squirro questions like you’re chatting with a friend. “Hey Squirro, what’s up with our sales in Q3?” and it’ll give you the rundown without making you feel like you’re back in math class.
Data Integration: It’s like the ultimate party host for your data. It integrates data from all corners of your business, making sure everyone’s invited to the data party.
Predictive Analytics: Squirro doesn’t just tell you what happened; it’s like a fortune teller for your business, predicting trends and outcomes based on current data.

Why Should You Buy It?

Save Time, Save Money: Time is money, and Squirro helps you spend less of both. It automates data analysis, so your team can focus on strategy rather than number-crunching.
Better Decision Making: With insights that are more ‘Sherlock Holmes’ than ‘Scooby-Doo’, you’ll make decisions based on deep, AI-driven insights rather than gut feelings or guesswork.
Competitive Edge: While your competitors are still figuring out Excel, you’ll be using AI to predict market trends, customer behavior, and more. It’s like having a crystal ball, but one that’s actually useful.
Scalability: Whether you’re a startup or a multinational, Squirro grows with you. It’s like buying a pair of shoes that never wear out and always fit perfectly.
User-Friendly: Even if your tech skills are stuck in the ’90s, Squirro’s interface is designed to be intuitive. It’s like having a personal data assistant who’s also your best friend.
Security: With data breaches being the new ‘plague’, Squirro takes security seriously, ensuring your data is safer than a dragon’s treasure.
Customization: Tailor it to your business needs. It’s like having a suit made just for you, but instead of fabric, it’s data.
Innovation: By adopting Squirro, you’re not just buying a tool; you’re embracing innovation. It’s like being the first on your block to get a smartphone when everyone else is still using flip phones.

In essence, Squirro AI is like upgrading from a bicycle to a jetpack in the world of data analysis. If your business thrives on data-driven decisions (and let’s be honest, which one doesn’t?), Squirro could be the superhero cape you didn’t know you needed. Just remember, with great data comes great responsibility!

Posted in Uncategorized | Comments Off

Darwin at his best: the GenAI race

Posted on 25 July 2024 by dselz

Darwin is renowned for his theory of evolution and natural selection. Looking at the GenAI space I get that same feeling. What looks easy at start is rough in detail and very selective. Let me explain:

Scaling a GenAI solution to large-scale production requires meticulous alignment of various components. Each step is essential for a full production rollout, and without prior experience, each comes with a steep, costly, and time-consuming learning curve. Let’s examine these steps:

Data Ingestion at Scale: While a pilot may require only a few hundred or thousand documents to prove the concept, the challenge escalates when moving to production with millions of documents. Ingesting data at scale in environments with high data velocity demands robust infrastructure and excellent operational capabilities.
Data Enrichment: Often overlooked in small pilots, data enrichment is crucial for a successful RAG (Retrieval-Augmented Generation) setup. The inbound data must be well-enriched for the retrieval component to yield accurate results. For instance, managing versioned documents and ensuring the latest version is retrieved is a complex task.
RAG Search: Sounds simple, is complex. All too often the systems out there produce seemingly good but factually incorrect results. After all search is probabilistic, so are LLMs. Multiplied probabilities don’t yield better results. So, you need to work hard on the retrieval part to really get good results, e.g. by factoring in better enriched data to produce more accurate result sets before exposing them to an LLM to produce a natural language looking result (and by using the LLM what it is good for, that is text comprehension and text generation and not what it is not good for, that is search).
Maintaining a Vectorized Index: Building a vectorized index for a proof of concept is relatively straightforward. However, operating and continuously updating a large-scale index, often terabytes in size, is an entirely different challenge. And you need to master the art to at the same time update a large (TBs of data) index while querying that same index.
Security: Security is critical in any enterprise setup. Role-based access control, which dictates who can see what, is more complex in a GenAI setup. Ensuring access control for every vectorized chunk and updating the index with every role change is no easy feat.
Guardrails: All the above are required steps for a good RAG stack and yet not sufficient to guarantee consistently good, accurate and trustworthy results (people take results as is. Why? That’s for another blog post). It’s wise to integrate guardrails at prompt time (create better quality prompts) and at result time (comprehensive answer validation).
Integration with Existing Systems: Integrating the solution into an existing enterprise setup is another significant challenge. Simply adding another dashboard is not useful. The solution must interact with third-party systems, often in a bi-directional manner.
Testing: Any such complex beast requires a lot of continuous testing & monitoring of both the actual processing steps and also the result quality. Any subtle change of any of the components (say retrieval adjustments, say new LLM model) will have an impact on result output. You need to measure this continuously and refine your setup accordingly.
Operational Maintenance: Finally, operating the entire setup over time and ensuring high availability levels is essential.
Measurable ROI: A common oversight is the lack of a clear cut business case detailing the cost-benefit ratio of any such rollout (they are impressive if well done*) By mid-2025, CFOs will start asking tough questions: “Where’s the bang for my buck?”

Delivering reliable, accurate, and transparent AI at scale is not easy, is costly, and yet if well done will yield massive benefits. At Squirro we’ve been doing this for a while – we happily share our experience. Have a look out our Knowledge Hub or simply get in touch!

PS: They are impressive indeed if you go beyond search and see this AI transformation as the biggest opportunity since the dawn of the (commercial) Internet of disintermediation and reintermediation of entire value chains. We will discuss this in a forthcoming blog post.

Posted in Uncategorized | Comments Off

EU accession through the back door

Posted on 2 June 2024 by dselz

A year ago, Credit Suisse went under. The rescue required 1/3 of Switzerland’s gross national product. CS was about a third as big as UBS. In other words, a new rescue (and most G-SiBs have come knocking in the recent past) would require the Swiss gross national product.

Even our federal government and the SNB cannot afford that. Who can?

Chinese National Bank: Probably not
Japanese National Bank: We have other challenges
Fed: OK, Switzerland is actually a little closer to Washington than Hawaii, but becoming the 51st state of the USA?
That leaves the ECB. It could probably handle it. Yet we are not a member (yet). But the constraints described above will bring us closer to Europe…

Posted in Uncategorized | Comments Off

Unveiling the next level: RAG+++. Our take on making LLMs enterprise useful.

Posted on 29 May 2024 by dselz

RAG wasn’t a concept 9 months ago. Today everybody talks about how to ‘Chat with Data’ and to do that us and others have popularized the concept of Retrieval Augmented Generation, RAG in short.

The traditional RAG stack combines the strengths of information retrieval systems and generative models. In essence, it retrieves relevant documents or data points and uses this information to generate more accurate and contextually appropriate responses.

The main drivers for this combination of LLMs with more traditional search techniques are a few:

This setup will reduce hallucination
It’s a cost effective alternative to ‘just’ use an LLM (GPU compute is more expensive than CPU compute)
LLMs are at core not meant for large scale search operations but for text comprehension nd text generation
Domain specific data: You interact with ‘just’ your (enteprise) data
Context aware outputs: A properly setup search engine will be able to comprehend and compute context into any answer
You can easily build industry or use case specific accelerators
And a few more (link to our RAG papers)

Now for enterprise usage at scale a few elements are missing:

Enterprise security including entitlement control (who is allowed to see what), data lineage and governance (who does what with data)
An entprise ready setup that includes how to operate such a RAG stack over time with often stringent requirements on how to develop, test, put into production and operate with different software enviornments, etc.
Any enterprise operates not just on textual data (what LLMs are good at primarily) but on real-time operational data from manifold sytems.
And in an enterprise it’s not about ‘chat’ but ‘action’ as in reliable and consistently produce an output that is desired in the market place

This elements are all missing in a ‘standard’ RAG setup.

We are announcing today the next level: RAG+++. We add a few critical components to the RAG architecture:

Enterprise Security

We have been working hard to bring a few key enterprise security requirements: Entitement handling / a full Access Control Lists (ACLs) implementation, transparent data lineage, a robust testing system. With this Squirro enhances data protection and compliance, a traceable path of data flow and transformations, ensuring accountability and integrity.

Graphs: Taxonomies and Process Graphs

The first major enhancement in RAG+++ is the incorporation of graphs, specifically taxonomies and process graphs.

Taxonomies: These hierarchical structures classify information in a way that reflects the relationships between different data points. By integrating taxonomies, RAG+++ can better understand the context and nuances of the information it processes. This leads to more accurate and relevant data retrieval and generation.
Process Graphs: These illustrate the relationships and sequences between various processes or operations. In a business context, process graphs can model workflows, supply chains, or customer journeys. By incorporating process graphs, RAG+++ can provide insights and responses that consider the entire operational context, leading to more informed and strategic decision-making.

Real-Time Operational Data Ingestion

Another significant advancement in RAG+++ is the ability to ingest and operate with real-time operational data sets. This capability ensures that the system is always up-to-date with the latest information, which is crucial for applications that rely on timely and accurate data. Whether it’s monitoring live sensor data in an industrial setting or tracking real-time market trends, RAG+++ can process and utilize this data to provide relevant insights and predictions.

Enhanced Security with Synthetic Data

Data security is a paramount concern in today’s digital age. RAG+++ addresses this by incorporating the use of synthetic data. Synthetic data is artificially generated data that mimics real data while preserving privacy and confidentiality. By using synthetic data, RAG+++ can train and operate AI models without exposing sensitive information. With this approach you test an application in lower environments and/or sandboxes. You also can expose sensitive data without any worry to 3rd party LLMs. This approach not only enhances security but also ensures compliance with data protection regulations.

Guardrails for Prompts, Brand, Regulatory, and Tone of Voice Compliance

Maintaining consistency in brand messaging, adhering to regulatory requirements, and preserving the intended tone of voice are critical for any organization. RAG+++ introduces robust guardrails to ensure these aspects are not compromised. These guardrails function as predefined rules and checks that the system adheres to during data processing and response generation. They ensure that:

Better prompting: Most of us are at best okay with formulating complex prompts even though they are required for good answers. The system provides in the background extended prompts for better results.
Brand Consistency: The generated content aligns with the organization’s branding guidelines, including visual style, messaging, and overall identity.
Regulatory Compliance: The system operates within the legal and regulatory frameworks applicable to the industry, avoiding potential legal pitfalls.
Tone of Voice: The responses maintain the intended tone, whether it’s formal, friendly, authoritative, or casual, ensuring consistent communication with the audience.

Agents to autonomize workflows

Agents complement the RAG structure by enhancing its capability to handle diverse and complex tasks. While RAG combines information retrieval and generation to produce accurate and relevant responses, agents can dynamically manage, orchestrate, and optimize these processes. This synergy improves the efficiency, accuracy, and adaptability of the RAG system in various applications.

The introduction of RAG+++ holds transformative potential across various industries:

Finance: Ingesting real-time market data, understanding financial taxonomies, and ensuring compliance with regulatory frameworks can lead to better investment strategies and risk management.
Healthcare: Real-time patient data, taxonomies of medical conditions, and process graphs of treatment protocols can enhance patient care and operational efficiency.
Manufacturing: Monitoring real-time production data, mapping out process graphs of manufacturing workflows, and maintaining data security can optimize operations and improve product quality.
And similar impact in any other industry.

For a sneak peak watch these videos:

https://go.squirro.com/Agents

https://go.squirro.com/Agents_AM

Conclusion

RAG+++ represents a significant leap forward. By integrating graphs, real-time data ingestion, synthetic data for security, stringent guardrails, and agents RAG+++ offers a comprehensive solution that addresses the multifaceted challenges of any enterprise. Happy to chat how we can support your business.

PS:

Initially (24 months ago) we called it raLLM – retrieval augmented LLMs. RAG came around and was adopted as the moniker for this combination of retrieval and LLMs. Here’s the challenge to the community: RAG+++ is our moniker. Any better ideas?

Posted in Uncategorized | Comments Off

AI will displaces jobs. And create more new ones.

Posted on 8 May 2024 by dselz

Yes, AI will displaces jobs. And it will provide opportunities for growth. Here’s a smart way to think about it.

25 years ago, when I had notably fewer gray hairs, I was part of a team that launched one of the pioneering e-commerce shops here in Switzerland for Otto Fischer, a distributor of electrical materials. This shift marked a significant change from the existing order-by-fax system that dominated the 1990s.

Back then, electricians had to submit their orders by 12 noon via fax to receive their materials by 7h00 the next morning. This process was not only time-consuming but fraught with potential errors due to the manual entry of faxed orders into the ERP system and difficulties in deciphering handwritten numbers—was that a smudged ‘3’ or an ‘8’?

Imagine an electrician who has spent the morning drilling into concrete walls writing a fax order. The drilling strain their handwriting, making the faxes difficult to read. Additionally, the technology of fax machines at the time was far from perfect, often adding another layer of confusion with noise and distortions in the transmission.

An entire floor of staff was dedicated to manually processing these orders, which involved verifying ambiguous entries by calling the wife of the electrician. These were the days when there were no mobile phones widely available. Edith would call Mary to find out what construction site John was on to figure whether it was a 3 or a 8. Inevitably the calls, quite a permanent feature, were used for updates on personal and family life, too. This personal touch built strong customer relationships.

The Digital Revolution: Introduction of the Webshop

With the introduction of the webshop, the ordering deadline was extended to 19h00, still ensuring a 7h00 delivery. This was a game-changer in terms of efficiency and convenience. However, it brought fear to the logistics team, who saw their roles—centered around manual order processing and customer verification—as becoming obsolete.

And we young guns nearly missed the point: Sure this is a family business, so people are not let go. Yet the manual order processing was not required anymore. What to do? Until – and we nearly missed it – we got some crucial insight: The logistics team due to their constant interactions over order confirmations had simply the best understanding of the product catalogue on offer on how the products mapped to construction requirements.

We shifted their role from simply confirming details to proactively advising on and upselling newer or improved products. Edith would still call Mary. Now she did not need to confirm 3 versus 8 but could talk about enhanced products and what solutions better met construction requirements. And sure, they still talked family.

In a at the time depressed real estate and construction market the company could significantly boost their sales. The secret to their success: The logistics team building a loyal customer base.

Lessons for Today’s AI Transformation

This transformation serves as a potent illustration for businesses today, especially in the context of the GPT revolution and other AI advancements. Yes the AI revolution will displace jobs. And yes, the simplest path for companies is to use technology strictly as a cost-cutting tool. However, the smarter, approach is to leverage technology to enhance human capabilities, not replace them.

Focus your colleagues on what humans do best: relationships building and decision making in situations of grey. Let computers to what they do best: faster compute. This strategy creates opportunities for growth and innovation in a changing technological landscape by combining human insight with digital efficiency.

I bet the most successful companies will be those that view innovations as tools to enhance, rather than replace, human capabilities.

Posted in Uncategorized | Comments Off

Columbus

Groundhog Day, actually Year

Elections 2026 – a short review and a promise

A little local journalistic farce

The State of Play in RAG and Enhanced RAG

AI is like WD40: it permeates everything, including productivity

What Grok* thinks about Squirro

Darwin at his best: the GenAI race

EU accession through the back door

Unveiling the next level: RAG+++. Our take on making LLMs enterprise useful.

AI will displaces jobs. And create more new ones.

Language / Sprache

Archive

Flickr