Parquet data format is quantum format. You never know what is inside because whatever tool you use for reading already interprets the data :-D The problem we are dealing with right now is the problem of timestamps and timezones in the data :-)
Jan Sobotka’s Post
More Relevant Posts
-
MPhil Data Intensive Science at Cambridge University | Two-Time Author | First Class Mathematics Graduate | Founder of Zinco
I implemented the K nearest neighbours algorithm on the iris data set in R. Firstly I cleaned the data and ordered it into a format that the K nearest neighbours algorithm can deal with. I then visualised the data and saw that the data was separated into two clusters based on the type of flower it was (versicolor or setosa). I then chose to test the point (6, 7) with K = 3 to determine which flower it would classify as. After implementing the algorithm it classified as versicolor. Here is the code: https://lnkd.in/eSYhsPWc
To view or add a comment, sign in
-
Day 2 of GFG POTD Question: - Largest Number formed from an Array Topic: -Arrays, Data Structures
To view or add a comment, sign in
-
By examining both scatter plots side by side, one can gain a more comprehensive understanding of the data's structure and relationships. The original data plot showcases the true variability and distribution, while the min-max scaled data plot highlights relative positions and patterns, making it easier to interpret relationships and compare different datasets on a standardized scale.
To view or add a comment, sign in
-
📌 Merkle Trees These are cryptographic data-structure using hashes, that helps verify data integrity without needing to recheck the entire dataset 📌 Merkle Proofs are way of proving that some data is there in the tree
To view or add a comment, sign in
-
Check out our latest Llama 405B video where you can explore the use of LLMs for fine-tuning and synthetic data generation: https://bit.ly/3AdAano
To view or add a comment, sign in
-
Binary data in matrices with singular value decomposition method https://lnkd.in/deFzpwDC
To view or add a comment, sign in
-
Clinical SAS Programmer | Expertise in Neuroscience Studies | Skilled in Data Analysis & Statistical Programming (SAS, SQL, Macros)
this is dataset as give below output DATA G; X="AB12CD34"; run;
To view or add a comment, sign in
-
Data-Driven Compliance | Financial Crime, Sanctions, Cybersecurity, Privacy | Data Science, Multi-agent AI
Considerations for k-shots: 1. Complexity of the tasks; 2. Quality of the data quality; and 3. Algorithm used. Experiment to balance performance and efficiency.
To view or add a comment, sign in