I manually determined there was an individual posing as two people (playing both the antagonist and the adversary) because they consistently misspelt certain words such as "definitely" as "defiantly".
Fingerprinting certain linguistic traits and mapping that to time-zones as well as confirming there is a partial overlap in posts but never exact worked exceedingly well. Someone can't easily maintain a fluent conversation between themself on two accounts, but they can either get close, either through unnatural delays between sentences or just never interacting with the "other" party at the same time.
Simplicity is the greatest form of sophistication! Great work!
One small nit from a user experience point of view..: it'd be easier on the eyes if you just truncated those cosine similarity scores (or whatever score you're using) after the, say, 5th digit. Showing the entire float is kinda messy to my eyes.
Don’t sell yourself short. Simplicity is smart. It’s astonishing how often the simplest thing turns out to be exponentially more effective than the so-called smart thing.
I can’t get over how phenomenal this is. Please put every one of your side project ideas into production!
cool and thanks for the clarification. i ask that mainly because of the request limit of openai, which is something that makes many scalable ideas unfeasible
In a way we are so lucky that we have recording technologies now for both video and audio and literally 100's of hours of people like him to enjoy. Imagine what it would have been like to visit a concert where JS Bach was playing. Back then no amplification and really only the lucky few that got to see this up close. The very best your average person could do would be to go to church for some half decent music.
yes, I completely understand your point. i'm a huge fan of Erik Satie for example and I'm lucky enough to be able to play 4 of his songs on the piano. I'd love to - at least - hear him playing his own compositions, but ofc it's impossible lol.
Oh thats dope, will read it! and I'm pretty sure I read your blog once, because of the pianojacq! obs: i'm planning a visit to Amsterdam this year because a conference of recommender systems, so will for sure visit the museum if I'm able to go there. thanks!
yep, i suddenly felt in love with music after my dad passed away - was in college studying materials engineering, and made a living out of music for 6 years (until the pandemics)
where I work for, our stack is all about GCP/Airflow/Python/BigQuery ML, for recommender systems. I'm now playing around with Turicreate (Apple) to compare with BQML.
I think you are missing one of the two pieces the question builds on: Linear Algebra (do you know how a dot product is defined?) or Datastructures including their complexity. You would prepare by reading up on whichever of these two you are missing.
If you know how it's supposed to work in theory but struggle to bang it out, you could do eg leetcode.
"However, for a data science beginner, SQL is the best place to start."
I totally agree with that statement. Being a beginner myself in the DS field, I'm living through this right now in my job. And, as a plus, working with SQL everyday is also helping me a lot to have different perspectives in handling the Python/Pandas DataFrame.