A common question that comes up when discussing ChatGPT in the context of knowledge, document and information management, is something along the lines of: "When or how will ChatGPT be able to discover our enterprise content, such as SharePoint Pages or Documents stored in Teams and SharePoint?"
In this blog:
Discovering enterprise content in Microsoft 365 with ChatGPT
Despite a reputation of having been proven wrong a few times when saying "it will never happen," I'm going to stick my neck out and say: "ChatGPT is not going to 'discover enterprise content' ... ever." I obviously have lots of caveats tied to this statement. The most important one being that I am referring to "discovering".
Without being an AI expert or a ChatGPT expert, I believe it's fairly clear ChatGPT is not designed to carry out scaled discovery of content within enterprise boundaries (i.e. within your Microsoft 365 SharePoint/Teams content). If my understanding is correct, it is in fact not designed to discover content at all.
As per Wikipedia's page on GPT-3: "Generative Pre-trained Transformer 3 (GPT-3) is an autoregressive language model released in 2020 that uses deep learning to produce human-like text. Given an initial text as prompt, it will produce text that continues the prompt."
In other words, ChatGPT doesn't look for or give you answers, unless you provide it with a prompt to do so.
However, this does not mean that it cannot work with you content. On the contrary - it will work with (and respond to) the content that you feed it in the prompt.
Example: Microsoft Teams Premium with ChatGPT
You may have come across – or you might be using – Microsoft Teams Premium, which is integrated with ChatGPT. It's tempting to argue that ChatGPT automatically gives me outputs from my Teams Meetings, the content of which could be considered enterprise content.
What is actually happening here is that the transcribed text from a Teams meeting is fed to ChatGPT (i.e. the transcription is part of the prompt sent to ChatGPT) and in return GPT-3.5 generates suggestions for tasks and actions that have come out of the meeting. If you consider the spoken word (which is transcribed to text) as enterprise content, fair enough. Either way, this is only happening because the Teams Meeting was enabled (see "Microsoft 365 Copilot" below) to send over the transcribed text for that meeting.
So in practical terms, imagine a scenario where during a meeting, Sally says "Bob, it would be great if you could dig out the quarterly sales numbers for Q4 last year, so we can compare them in our next meeting."
What we should not expect from the GPT service alone is that it performs some sort of an internal search, and digs it out for us (or Bob). In other words, we will not see something like this from GPT without us feeding the information to GPT:
"Bob will find the sales number for Q4 last year (suggested file here: 2022-Q4-Sales-Results.pdf)."
What we can expect is that GPT auto-generates a Suggested Task as follows:
"Bob will find the sales number for Q4 last year."
Below are some examples from Microsoft's blog announcing Teams Premium:
Does ChatGPT not look at any file content in Teams or SharePoint? Or SharePoint pages?
No, it doesn't do that directly or proactively - it requires something or someone to feed the content to it. (See seperate blog about Microsoft Copilot published March 2023)
In other words, in order for ChatCPT to 'look at' internal enterprise content such as pages or documents, the user (or some automation acting on behalf of the user) will have to grab the textual content, and send it in a prompt to ChatGPT with an instruction of what to do with it.
So, if I want ChatGPT to summarise a document, I would have to copy the document text, then move over to ChatGPT and enter something like: "Summarise this: [past my document content here]"
A pseudo prompt could look something like this:
“Summarise the following document in maximum 3 paragraphs, and provide a list of document chapters, each with a maximum 50 word summary of each chapter, followed by up to 10 suggested keywords for the document: [DOCUMENT BODY]”
Of course, this can be done programmatically through APIs, and this is where Microsoft 365 Copilot comes into the picture (announced on March 16 2023) which provides premium capabilities to automate this capability across Microsoft Office, Outlook at Teams (not just Teams Meetings). For instance, the automatic creation of a proposal, based on existing documents and emails, or in other areas of content service automation.
Microsoft 365 Copilot
Microsoft's solution that will allow ChatGPT (or other Large Language Models) to do something with your own content is Microsoft 365 Copilot. Announced by Microsoft on March 16 2023, this capability (which is in its early stages) will help business users combine their prompts (from within a Microsoft app such as Word) with content in their Microsoft 365 tenant (using Microsoft Graph). This combined prompt is then sent to the LLM (Large Language Model, such as ChatGPT), resulting in a user and business relevant response. The key thing here to note is that ChatGPT did not itself go and grab or discover any content. You can read in this blog introducing Microsoft 365 Copilot and Business Chat.
Examples of summarising content - internally and externally
So to put this into perspective, ChatGPT does a good job of summarising a piece of content when it is available in the public domain. But without something like Microsoft 365 Copilot, it would not know what to do with any of your internal content.
Example: Prompting for a summary of content available on the internet
Here's a real example, using a publicly available article authored by Chris Marino, from Enterprise Knowledge (linked here).
My prompt to ChatGPT was:
"Summarise in maximum 200 words an article called 'Is ChatGPT Ready for the Enterprise?' by 'Chris Marino' and add a list of headlines with a one line recap for each section."
Not a bad summary at all, with 6 headlines summarised – although the actual headlines are not correct. ChatGPT has summarised its own interpretation of the key themes as headlines, but it did – as requested – provide a one-line summary for each of these.
So while this is impressive, the limitation is crucial. I had to prompt it (using either the actual text of a document, a link to a publicly available article, or in this case a clear instruction on how to find that publicly available article), which allowed the pre-trained GPT service to perform its magic.
Example: Prompting for a summary of content available on our internal systems
So let me now compare this, by creating a similar prompt, where I want ChatGPT to summarise internal content that I know exists in our own Atlas Digital Workplace for ClearPeople. The result from ChatGPT is now very different when asking as follows:
"Summarise in maximum 200 words an internal article called 'Viva Topics' by 'Gabriel Karawani' and add a list of headlines with a one line recap for each section."
The above is of course purely using the public ChatGPT service, without the assistance of Microsoft 365 Copilot or any other integration with internal content.
What are good business uses for ChatGPT and should its use be managed?
What is certain is that any enterprise ChatGPT capability will come at a premium (such as Teams Premium) as it requires some not-insignificant computing resources to run, plus any investments you might consider for bespoke/custom models (aka 'fine-tuning').
Running such a service against all your enterprise content would therefore come at a high price (and consume a lot of energy). Apart from the cost question, you should also ask 'why?'.
And here are some other questions organisations will have to ask themselves:
- Why would you use a GPT AI for internal content?
- What are the key scenarios that would benefit from a generative AI text transformation service?
- What is the ROI in comparison to doing nothing, or compared to other capabilities/features you can employ more efficiently?
- What governance might you want to consider?
- What tools will you invest in and make available to your end-users? (see article about Shadow AI here)
- How will you construct prompts to provide the types of responses you expect (targeting the right audience, removing bias, etc)?
- What are the regulatory / compliance considerations (e.g. should you feed PID to ChatGPT)?
- How will you deal with quality assurance of responses (the service is generic and far from error-free, so how will you validate and approve responses for further use and action)?
- Should you prevent any automated decision-making based on ChatGPT responses?
- How will you deal with permission management (who can run this and what content can be fed to it)? If you create an integration, what will the service have access to?
- If you are training custom GPT models with internal content, should you prevent any sensitive information from being used?
- How will you budget for and control the running costs?
- What do you do / what is the intent with the response from the AI service?
- For instance, in the example above, how might you leverage the summary and the suggested keywords?
- How might you share the outcomes, to prevent the same or similar queries being run time and time again, increasing your costs?
It's early days for GPT – but it is exciting nonetheless when considering its possible use cases. Internally, custom-trained (fine-tuned) GPT models with internally relevant datasets are likely to be a must, to provide sensible outcomes for your users.
A year from now, I am sure we will see it in action for lots of creative and innovative use cases that we have not even thought of yet. Have a look at the Forbes article with a list of job areas that are likely to feel an impact (for better or worse).
My prediction, however, is that enterprise content discovery is not one of those we will see successfully implemented.
What does ChatGPT have to say about this?
Before publishing this, I thought I better check if ChatGPT agrees with the general conclusion:
More blogs on artificial intelligence:
References:
|
Real scenarios for using Microsoft 365 Content Services and SharePoint Syntex
Watch this webinar where Microsoft, ClearPeople and their customer, Northumbria Water Group, discuss real and practical scenarios for how NWG uses Syntex and what they're achieving with it.
> Watch webinar
|