Articles

Data Frame

July 28, 2020

Quick tip on using data visualization in Pandas. If you want to quickly display your dataframe, then use the pd.DataFrame.plot(). By default, it will plot your data in a line; however, if you need other types of charts for your data then just state “bar”, “hist”, etc within the parentheses. Use this command for when […]

Jupyter Notebook

July 21, 2020

A common IDE many data scientists have found themselves using is a Jupyter Notebook. We’re not saying everyone should switch to using a Jupyter notebook but consider giving it a shot. Its features make it easy for developers to display and explain their code. The option to change a cell to utilize Markdown syntax and […]

SAME Facilities Management Workshop

July 20, 2020

Data Products Principal Data Strategist Dr. Mechie Nkengla will present at the SAME Facilities Management Workshop on July 29th, 12:30-1:30pm on the topic of “Artificial Intelligence and Improving the IQ of ‘Smart’ Buildings.” We are looking forward to furthering the conversations around enhanced building security, more efficient use of resources, and improving communities.

Precision and Recall

July 13, 2020

When evaluating machine learning models, accuracy is usually the go-to metric. However, it is not always the appropriate metric, especially for imbalanced datasets. Use the Precision metric when you need to reduce the amount of False Positives and use the Recall metric to reduce the amount of False Negatives. To figure out your scores for […]

pandas logo

Pandas

July 7, 2020

Still working on datasets with Excel? Maybe consider giving Python a chance? Many Data Scientists have moved on to working with Python to manipulate, engineer, and analyze large datasets. Use the Python library, Pandas, to retain that familiar Excel format but with all the Python tricks. Pandas has been the go-to library for many data […]

Data Profiling for Business Analytics

June 30, 2020

Quite simply, data profiling is an analysis of the content of a data source. This process allows you to assess the quality of data which may ultimately have an effect on your business analytics. There are three main types of profiling to keep in mind: 1) Content discovery. This involves searching for errors by reviewing […]