This company wishes to improve the collaboration across their departments and reduce the onboarding attrition.

This organization is distributed across an entire continent - with eventually poor connectivity.

The organization’s public web sites store thousands of PDFs (projects TARs & RRPs, briefs and papers, …) , blog posts, author biographies, … and more.

We have converted non-structured and semi-structured data into structured data. We have extracted publication dates, author names, countries, and texts. We have then mapped the recognized author names to a primary source of truth.

Challenges: