Azure AI for M365 Information Governance and Protection

Microsoft Azure has Cognitive Services available that can be used for auto-classification of files and images. This include the following features: Natural language processing skills include entity recognition, language detection, key phrase extraction, text manipulation, sentiment detection, and PII detection. Image processing skills include Optical Character Recognition (OCR) and identification of visual features, such as […]
M365 governance as foundation for innovation

Information is the new oil – it can be used to add value, reduce costs, manage risks, and disrupt competition. But for many organizations, a lot of their corporate information is out of reach and difficult to find. Cloud platforms as a game changer With a hosted architecture, companies often had to buy a […]
Microsoft Podcast: How to start your information governance and records management journey for M365

In this episode, Microsoft speaks with Vivek Bhatt, Chief Technology Officer at Infotechtion, about charting the direction of your information protection and records management journey. Listen in as Vivek and Bhavy discuss the importance of identifying your company’s key success factors, developing a roadmap that quickly provides value to your business, and the lessons learned […]
Webinar: Automate M365 governance with new Microsoft machine learning and AI

Webinar Wednesday February 24th, 2021 at 9am PT / 12pm ET / 18:00 CET Microsoft has introduced new AI and machine learning that can automate how important data and content is identified and classified. This will improve search and discovery, but also ensure compliance with business standards and regulations, e.g. GDPR. Attend this webinar to […]
Document Autoclassification Strategies

Once again, be careful with the fact that the exact same piece of content can be classified in different ways depending on function, not content. An invoice is an invoice, unless it is training material for an invoicing system, unless it is evidence of fraud or litigation about payments, unless it is sample data, unless […]
Document Autoclassification using Content: Similarity and Topic Comparisons

Similarity and Topics Similarity classifies by determining how close one document is to another. This is one area where AI is being leveraged. There are a number of variants to this capability but generally, if you know that one thing is a true representation of what you are looking for (an “exemplar”), other things that […]
Document Autoclassification using Content: Keywords, Number and Word Patterns

Keywords A word (or set of words) can be associated with a type, metadata or security. Usually, a keyword to find a type of content (like using the word “contract” to find a contract) is problematic for a number of reasons and is not very useful. You will find contracts, but you will also find […]
Document Autoclassification using Context: File Extensions, Metadata and Properties

File extensions This is the easiest entry point into classifying content because it can be done with a good >DIR command and a spreadsheet. It is done without needing to access and open the content – which makes it fast. It can also be done very efficiently from the cloud. The file extension mostly tells […]
Document Autoclassification: Source and Purpose

Before we start looking at classification techniques, a few concepts need to be defined. Source data The information we rely on to classify content automatically comes from three sources within a single document. Format – Format often includes the coding that allows a specific application to work with it. It can also include any structure […]
Document Autoclassification Technologies

A MasterClass – ification for unstructured content Post 1 This is an exciting time to be in the world of AI, machine learning, and autoclassification for unstructured information. This is a broad topic, so I want to focus on very specific elements that you can search on to auto-apply classifications and where they work well. […]