John Kerski in Data Science Comparing Groups for Similarities in Power Query – Using Cosine Similarity I’ll admit upfront—I am not a data scientist by trade. Instead, I’ve picked up my data science skills over time,... 17 December 2024 7 min read
Editorials Kathi Kellenberger in Editorials Machine learning, both exciting and scary The 2002 movie Minority Report is about a police unit called PreCrime, which can predict when people will commit a... 25 August 2021 2 min read
Data Science Shree Das in Data Science Kubeflow for data scientists introduction Machine learning projects often stall when it's time to deploy. Shree Das introduces Kubeflow for data scientists, an end-to-end solution... 08 January 2021 7 min read
Buck Woody Data Science Laboratory System – Distributed File Databases Distributed File Databases manage large amounts of unstructured or semi-structured data. They are designed on the principle of splitting up... 21 March 2014 15 min read
Buck Woody Data Science Laboratory System – Key/Value Pair Systems Though the Key/Value pair paradigm is common to almost every computer language, there is no clear agreement yet for the... 17 July 2013 17 min read
Buck Woody Data Science Laboratory System – Relational Database Management Systems There is no better way of understanding new data processing, retrieval, analysis or visualising techniques than actually trying things out... 14 June 2013 19 min read
Buck Woody Data Science Laboratory System – Programming and Scripting Languages Although every computer language is suitable for data, some languages lend themselves especially well for working with certain types or... 16 April 2013 18 min read
Buck Woody Data Science Laboratory System – Interactive Data Tools Data tools interact directly with data and are great for automating data data-aquisition, but they aren't always the best way... 18 March 2013 15 min read
Buck Woody Data Science Laboratory System – Instrumentation It is sensible to check the performance of different solutions to data analysis in 'lab' conditions. Measurement by instrumentation makes... 18 February 2013 18 min read
Buck Woody Data Science Laboratory System – Testing the Text Tools and Sample Data Anyone who is frequently faced with preparing data for processing needs to be familiar with some industry-standard text-manipulation tools. Awk,... 15 January 2013 18 min read