Informer: A bot library that allows masquerading as real users on Telegram

jhabdas · on Dec 10, 2019

This seems novel until one realizes public channels are already available on the web.[1] Also, I'm surprised Paul didn't go with Pyrogram for creating the user accounts (which have unlimited cloud storage and 1.5GB file limits btw).

[1] https://telegram.org/blog/privacy-discussions-web-bots#view-... [2] https://github.com/pyrogram/pyrogram

ksangeelee · on Dec 10, 2019

The words 'mass surveillance' in the project title seem to me sensationalist. From his repository list, I suppose the author is involved in ads and marketing, which would figure.

Had I parsed Usenet feeds into a relational database, and called it 'mass surveillance', I'd have rightly been ridiculed.

Regardless, it can be useful to have message data in this format.

riter · on Dec 10, 2019

I simply was not aware of Pyrogram, thank you for sharing that and pointing it out.

And yes, I admit the headline is a bit charged and its implementation does not reflect it as much as I would like.

It does not change the fact that I researched an implementation of Telethon for a use case I needed, and I didn't find it so I made it and shared it.

My involvement in ads and marketing had nothing to do with that. It's less nefarious, it just sounded cooler and was an innocent mistake in retrospect.

As mentioned above, I appreciate the mod(s) for taking the time and energy to edit the title so it reflected the implementation. This is my first shared project on HN and I am excited to finally participate and see the feedback/energy.

Thank you guys/gals!

19h · on Dec 10, 2019

Sounds like a good old IRC channel logger.. but for Telegram. The title is some serious click bait.

That said, I haven’t checked deeper, if this bot is not actually using the bot API but MTProto, this is pretty significant as the bot appears like a normal user (and not as a bot, which are required to have a “-bot” suffix on Telegram).

Robadob · on Dec 10, 2019

It states that it's using telethon.

That appears to be an MTProto library.

https://docs.telethon.dev/en/latest/concepts/botapi-vs-mtpro...

riter · on Dec 10, 2019

Yes, it does use MTProto and yes it is a real user. And yes admittedly the title in retrospect was a bit charged but the applications remain the same if extended.

wtmt · on Dec 10, 2019

What prevents anyone from doing the same with Signal or any messaging service that allows one to build a client? This program isn’t pretending to be a bot on Telegram, and it works as a normal user in every way (including the requirement for a working phone number, even if it’s a burner number).

leppr · on Dec 10, 2019

You can't ever assume that clients aren't logging everything that goes through them, even if there's no official documentation/API for custom clients. If a human can read a message, for all intent and purpose assume that a machine can too. For instance, things like Snapchat self destructing messages rely more on social norm than technology.

batushka3 · on Dec 10, 2019

Snapchat messages do not self destruct. They are just hidden. Multiple times during some glitch I saw old messages from weeks ago.

vips7L · on Dec 10, 2019

In this day and age who would actually believe anything in snapchat goes away after being viewed?

dorchadas · on Dec 10, 2019

As a teacher, I can most certainly tell you teenagers believe it does go away. Which is a problem.

m12k · on Dec 10, 2019

Well, Signal comes with a system for verifying a person's identity, so you can be sure it's really someone you know and not an imposter. But sure, for semi-public channels that will let anyone in without verification, something like this would allow you to monitor it. Lesson: If you're using Signal to run a dissident network and organize protests, be sure to verify everyone before adding them to groups.

kome · on Dec 10, 2019

> Well, Signal comes with a system for verifying a person's identity

like what? I really have no idea.

m12k · on Dec 10, 2019

There's an article about how it works in their help section: https://support.signal.org/hc/en-us/articles/360007060632-Wh...

AmericanChopper · on Dec 10, 2019

This doesn’t verify an identity, it verifies that somebody has access to a key that at some point in the past, you chose to trust. Signal has no mechanism at all for verifying identity, verifying the authenticity of the safety number is entirely up to you.

ganstyles · on Dec 10, 2019

Maybe I'm missing the point, and if so please correct me, but when I add new people to signal we do so in person and there's a QR code where one can "verify" the person. It obviously links the verified user with their phone, so theoretically someone could steal their phone and pretend to be them I guess. But it is making them a "verified" user, it's just up to me to verify.

AmericanChopper · on Dec 10, 2019

The safety number is essentially equivalent to a self-signed certificate. You can use it to consistently identify a key holder, but it doesn’t offer you any way to identify who that key holder is. If you want to trust a self-signed cert, then figuring up whether you should or not is entirely up to you. This is the problem that CAs address with CA signed X.509 certificates.

If you want to validate identity as a service, then the only options you really have available are a central authority, or a web of trust. Both of which have serious downsides, and neither of which are offered by Signal.

iudqnolq · on Dec 10, 2019

That's a generally correct statement, but the comment you replied to mentioned a specific case for which it's false.

If you can reduce the trust problem by requiring every person to verify out-of-band the identity of every one of their contacts, it becomes a lot simpler. For some Signal users, such as the person you replied to, this appear to be the case.

AmericanChopper · on Dec 10, 2019

Marking a safety code as trusted in signal is not different in any way to adding a self-signed TLS certificate to the trust store on your computer. You can perform exactly the same out of band verification of the self-signed cert. Signal does not have a single feature that verifies the identity of your contacts, they don’t pretend to have such a feature, and frankly I’d say it’s dangerous to assert that they do. How you do that is 100% your own problem to solve. The only thing it does is provide you with a trust store to use to keep track of which contacts you have personally decided to trust.

iudqnolq · on Dec 10, 2019

I agree completely. I merely wanted to point out that your reply that the problem is really hard was in reply to a person who solved the trust problem using the tools signal provides by having a simple problem (will always meet people in person first) and doing some work (verifying safety numbers).

riter · on Dec 10, 2019

This is effectively true, baseline it is indistinguishable from a "real" person, questioning the security model of openness and potential for mass social engineering. There is a reason why the hurdle of overcoming scaling the creation of a "real" phone number is difficult.

kaens · on Dec 10, 2019

nothing aside from things that require actual human interaction, so basically nothing.

juskrey · on Dec 10, 2019

This is not surveillance, this is just another channel crawler.

jimws · on Dec 10, 2019

Yes, the project is disingenuous. It claims there is a bug but really it is just functioning as designed.

riter · on Dec 10, 2019

I wholly admit the title was a bit charged and I appreciate the correction from the mod(s). That said, it questions the security model of privacy apps and the nature of open source. As I mentioned above, there is a reason they make it a hurdle to scale account creation.

It represents more than just a crawler if you have an imagination. That said, I was looking for a solution for one of the listed use-cases and none existed so I did something about it and shared it and now one exists. It takes no creativity to deride a work, and enough to make one and put it out there.

Getting both sides of the feedback has been a fun learning experience and looking forward to putting more out given the amount of feedback! Thank you both :)

EncryptEntropy · on Dec 10, 2019

I maintain some medium-sized TG channels and constantly have what we have deemed “surveillance” accounts join daily. One of the admins of the chan implemented a simple turing test bot which requires immediate 60 sec solving of a basic math equation or the account gets kicked. They were solving the “click this button to verify” but none at all seemingly know how to solve 6+4, or they can’t read English quickly enough.

big_chungus · on Dec 10, 2019

A few discord groups I've joined use this bot to verify through re captcha (which I hate, but is still a potential solution): https://auttaja.io/

riter · on Dec 10, 2019

Yes, the click button is on the todo. I have encountered these and the common one is the button click followed by the basic arithmetic. Solvable but anything beyond this would "defeat" it.

Tepix · on Dec 10, 2019

Unrelated but I wish Telegram would support encrypted group chats like Signal does.

Legogris · on Dec 10, 2019

And, at least as importantly, multi-device end-to-end encryption. E2E conversations on Telegram are almost as rare as on FB Messenger, IME.

fastball · on Dec 10, 2019

Security vs. Usability. Multi-device E2E requires ONE device be the source of truth for the private key. That's why for WhatsApp Desktop to work, you need your phone to be connected. This defeats the purpose for most people.

kelnos · on Dec 10, 2019

Does it, though? I feel like most people just use their phone, and that's it, and people who use the web version don't mind so much that they need their phone on the internet, because it pretty much always is anyway.

Sure, there are edge cases, like being on a plane and only having bought internet access for your phone, phone battery dead and no charging cable, phone lost/stolen, but those things seem rare enough that most people just live with it.

I would prefer that I didn't need my phone to use WhatsApp Web, but in practice it hasn't kept me from using WhatsApp (mobile or web).

crypt1d · on Dec 10, 2019

for me, its one of the reasons why I prefer Telegram over WhatsApp. There have been numerous cases when my phone was dead or offline (being abroad/roaming, battery dead, etc) and I wanted to use my laptop to finish that important conversation. The other one is lack of a native Linux desktop client for WhatsApp.

kelnos · on Dec 10, 2019

Right, I get that, and even mentioned one of the failure modes you mention. But I think for the average user (which more or less disqualifies most of our experiences here), it's not even remotely a deal-breaker.

vetinari · on Dec 10, 2019

Threema works similarly, but I rarely use web version, because it is annoying.

exo762 · on Dec 10, 2019

Transition from this:

> Multi-device E2E requires ONE device be the source of truth for the private key.

To this:

> That's why for WhatsApp Desktop to work, you need your phone to be connected.

Is just an implementation detail. Signal gets around this by letting server know about person's devices so that each device can sync independently of others. Phone holds ultimate key, but messages do not need to be routed through phone.

Reelin · on Dec 10, 2019

> Multi-device E2E requires ONE device be the source of truth for the private key.

That doesn't sound right to me. It may well be the case for a particular implementation, but I don't see why it would hold for the general case.

zahllos · on Dec 10, 2019

The way Wire does this, if I recall correctly, is that logging into your account (proving to the server you can authenticate) allows you to push up a new device public key, which you can see as your device fingerprint in the UI. A conversation between two users with multiple devices is really a group chat, with pairwise ("client fan out") double ratchet sessions going on.

When you do this, your other devices are informed of the account change by the server, as are people you communicate with (if they've previously marked your account as trusted, changing any devices on your account changes that). This isn't much different to Signal: ultimately the server acts as a key directory in both cases.

The problem with this approach is that it doesn't scale well at all. This is why Facebook, Wire etc are working on MLS (Messaging Layer Security) which is basically "add trees" so group chat scales better.

Reelin · on Dec 10, 2019

Thanks for explaining, that makes much more sense.

It seems to me that double ratchet is really to blame here. Without it, you could simply share a single key across all devices. With it, your choice is to either deal with this sort of complexity or to set up a trusted proxy in the middle.

It's a bit strange actually. There's this constant mantra of having to pick either security or usability. We now have readily available means for usable _and_ reasonably secure E2E, but the crypto nuts go and add additional "must haves" that once again make it difficult for the average person to use.

An aside: Instead of authenticating with a central server to add a key (as you've described Wire doing), why not handle this client side via X.509 certificate chains? This is very mature crypto and seems far more flexible. It would enable use of standard PKI token hardware for managing your root identity, allow fully offline enrollment of new devices, and provide cross signing for various purposes (changing your root identity, setting up a web of trust with a group, integrating with a corporate environment, etc).

Legogris · on Dec 10, 2019

No, it does not. Keybase does it pretty well. Signal is pretty close. Even the WhatsApp approach (require phone to be online, which is how it differs from Signal) is better than what Telegram has today (not even possible).

noxer · on Dec 10, 2019

The whole purpose of cloud chats would be broken if e2e would be implemented this way. Search for example basically impossible to implement anywhere near the way telegram does it.

Also from all my group chats and I have hundreds 99% are public or semi public (not searchable in telegram but can be found on the internet) it make no sense to encrypt this.

gsich · on Dec 10, 2019

Artificial limitation, not a technical one.

sschueller · on Dec 10, 2019

This is very cool. I wish there was something like this for whatsapp so I can forward my messages to another IM service.

heroprotagonist · on Dec 10, 2019

Twilio has a WhatsApp integration. It's paid per message, but it's pretty cheap if you're just forwarding a single user's messages. Probably a bit expensive to do at scale though.

I'm not sure what kind of open source apps leverage it, but I would guess there is something.

koalalorenzo · on Dec 10, 2019

Matrix?

mosselman · on Dec 10, 2019

Is matrix the other Im service or the tool with which you can forward WhatsApp messages or both?

BubuIIC · on Dec 10, 2019

Matrix is the other IM service. This is the tool for getting whatsapp chats into matrix: https://github.com/tulir/mautrix-whatsapp

jhabdas · on Dec 10, 2019

Wait until EU gets their data portability laws ironed out.

ailideex · on Dec 10, 2019

Yeah, I mean they really ironed cookies out something fierce, with every website now spamming us multiple times a week probably asking if we want to store cookies. So ironed out.

Nextgrid · on Dec 10, 2019

Websites that do that are actually in breach of the regulation.

The problem is that they're allowed to get away with it and the "4% of global turnover" fines are yet to be seen.

neiman · on Dec 10, 2019

Is there any positive ethical aspect of this tool?

ArnoVW · on Dec 10, 2019

Eliminating information dissymmetry, by showing to the rest of the world what -given the simplicity- undoubtedly many other people have already found out (but have kept for themselves).

notduncansmith · on Dec 10, 2019

For that to hold reasonable ethical weight, you’d need to spread wide awareness of this tool. I’d say you qualify for that if, for example, you were to run this bot in some kind of “warning mode” that would join unprotected groups and let participants know of their danger.

riter · on Dec 10, 2019

Fair point. At the very least it highlights the security model of OSS in general.

yoavm · on Dec 10, 2019

Not a fan of Telegram, but as far I understand this only let's you scrape messages from public channels. People shouldn't really expect anything else when you write a message on a group with 100s of other people

DavideNL · on Dec 10, 2019

Exactly, and this tool even joins the channel. I mean what else do you expect, it's not a surprise that anyone who joins a channel can read the messages.

riter · on Dec 10, 2019

Not untrue. And its one ngnix ec2 instance away from silently changing it from anyone who joins a channel to read the message to the world indexing all those messages.

22c · on Dec 10, 2019

Sure, drawing attention to the weaknesses of Telegram.

Similar discussion from previous HN thread:

https://news.ycombinator.com/item?id=20776327

diffeomorphism · on Dec 10, 2019

So what is the weakness here? It just seems to say public channels can be read by anyone, duh.

hnarn · on Dec 10, 2019

The README clearly states:

>Potential Business Applications:

>Sock puppeteering to overthrow a despotic regime

>Brand monitoring and sentiment analysis

>Shilling cryptocurrency at a moments notice for financial gain

>Influencing sentiment on topical issues

>Getting in on price action early

>Running analysis of a telegram channel

While some of them are arguably unethical, some of them are almost certainly not.

riter · on Dec 10, 2019

Some of these applications were intended to be tongue in cheek and point out ethical applications were up to the imagination.

BickNowstrom · on Dec 10, 2019

Gathering intelligence on - and data from - bad actors.

Do a search for "site:telegram.me" including a keyword from any illegal activity, such as carding, and you'll find hundreds of channels with interesting behavior.

kome · on Dec 10, 2019

I mean, it's a bot. Like a search engine bot. It works only for public groups or if it's allowed in a group...

I don't see any negative ethical aspect to be honest.

riter · on Dec 10, 2019

The easiest negative ethical aspect I would point out is in chat apps like discord or TG we have a reasonable expectation of privacy that is not necessarily implicit in the APIs. It could be one ec2 free-tier instance away from making all that information indexable.

ShorsHammer · on Dec 10, 2019

Depends who you think should be able to bomb people based on metadata I guess.

It's all relative.

jhabdas · on Dec 10, 2019

> We Kill People Based on Metadata

Former head of the National Security Agency Gen. Michael Hayden

DagAgren · on Dec 10, 2019

Keeping an eye on cryptocurrency scammer channels?

walrus01 · on Dec 10, 2019

Exposing fundamental architectural flaws of Telegram?

fastball · on Dec 10, 2019

Public groups being publicly accessible is an architectural flaw?

riter · on Dec 10, 2019

The ability to autoscale accounts with the generation of "real" phone numbers breaks the model

tapland · on Dec 10, 2019

It's not. You could also mass surveil HN since it's public.

wk0 · on Dec 10, 2019

Going to hijack this thread to plug some basic Telegram API tutorials I've written on Medium

* Send and Receive Messages with the Telegram API https://medium.com/@wk0/send-and-receive-messages-with-the-t...

* Running a Serverless Telegram Bot from AWS Lambda https://medium.com/@wk0/running-a-serverless-telegram-bot-fr...

* Integrating Your Serverless Telegram Bot with AWS API Gateway (published today) https://medium.com/@wk0/integrating-your-serverless-telegram...

riter · on Dec 11, 2019

good on ya ;)

captn3m0 · on Dec 10, 2019

Looks great. I found out about https://docs.telethon.dev/en/latest/, which looks like a usable Python API for Telegram user accounts (not bot API).

I can finally build by telegram-to-rss project!

Amir6 · on Dec 10, 2019

It would be great if someone can do an opensource like that, as mentioned by others Telegram is not privacy friendly but having access to the content that spreads around in such platform can be very valuable

riter · on Dec 10, 2019

Sweet! This point of this project was I found no boilerplate for one of the listed use cases so I just built it in 2 days and later shared it. I hope you do the same :)

noxer · on Dec 10, 2019

This is so stupid. Public channels and Groups have a web frontend it doesn't even need a telegram account to see or crawl them. Example link: https://t.me/s/durov/110

smashah · on Dec 10, 2019

Wow nice work. I'm sure Rukmini Callimachi will love this!

HNLurker2 · on Dec 10, 2019

>Callimachi left Romania during the communist regime with her mother and grandmother, for Switzerland and then the United States.

Inspiring actions

smashah · on Dec 10, 2019

She does amazing work reporting on the Isis assholes on telegram

riter · on Dec 10, 2019

That is actually an awesome and "ethical" use case for this project. Thank you for sharing who this journalist is and sharing ideas on ethical application.

thomasfl · on Dec 10, 2019

This makes it so much easier to keep track of the action on "The pump & dump cryptocurrency club" channel on Telegram.

riter · on Dec 10, 2019

Hahaha.. yes, if you can't tell this was a boiler plate for tracking and analyzing the activity of the wonderful seedy world of crypto.

riter · on Dec 10, 2019

TBH what I find most humorous is no one mentioning the obscure 90s reference :)

jhabdas · on Dec 11, 2019

Yo snow the five-o came around looking for you the other day...

surajs · on Dec 10, 2019

lemme just say ... ha-ha!