☆

1

The idea of using a hierarchical clustering of nodes in order to guide network embedding is very nice!
### Related work:
Hierarchical clustering for network embedding has also been used in the two following papers:
- [HARP: Hierarchical Representation Learning for Networks](https://papers-gamma.link/paper/109)
- [MILE: A Multi-Level Framework for Scalable Graph Embedding](https://papers-gamma.link/paper/104)
### Scalability:
The largest graph used in the experiments has only 334k edges and the running time on that network is not reported. In figure 7, the method takes 2 minutes on a BA network of less than 10k nodes and 150k edges.
It would have been nice to report the running time on a graph of 1G edges in order to evaluate the scalability in a better way.
### Reproducibility:
The implementation does not seem to be publically available and as the proposed method is complicated, the results are hard to reproduce.
### Table 3:
It is interesting that the dot-product approach leads to the best results for link prediction in comparison with the 4 strategies used in node2vec. Usualy, Hadamard leads to the best results.

Comments to a comment:
1. Code is available at https://github.com/JianxinMa/jianxinma.github.io/blob/master/assets/nethiex-linux.zip at the moment. It will probably be put on http://nrl.thumedialab.com/ later.
Yeah the implementation is quite complicated. Still, a lot simpler than the variational inference for the nested chinese restaurant process (which I can't find any implementation).
2. Scalability shouldn't be a problem in theory (the time complexity is linear per iteration in theory). But the released code is not really optimized (not even multi-threaded or distributed).
3. It's possible for Hadamard to be worse than dot product, as it requires training a classifier and can over-fit if not tuned carefully or trained on a small sample.

☆

1

### Style
I like the Levin style, he writes a bit provocative phrases that looks, in the same time, incredibly truthful (at least for the author himself). Consider for example what he says about the invention of positional numeral systems and Quantum computers:
> Archimedes made a great discovery that digital representation of numbers is exponentially
> more efficient than analog ones (sand pile sizes). Many subsequent analog devices yielded
> unimpressive results. It is not clear why QCs [quantum computers] should be an exception.
### One-way function and the axiom of choice
One-way functions and the axiom of choice deals with practical
or conceptual (im-)possibility to solve [inverse problems](https://en.wikipedia.org/wiki/Inverse_problem) arising in Computer Science and Mathematics.
### Kolmogorov complexity and One-way functions
Two primes $p$ and $q$ have almost the same [informational content](https://en.wikipedia.org/wiki/Algorithmic_information_theory) as their product $pq$.
However, the [$p, q \mapsto pq$](https://en.wikipedia.org/wiki/Computational_complexity_of_mathematical_operations#Arithmetic_functions) is much more easy to do than
[$pq \mapsto p,q$](https://en.wikipedia.org/wiki/Integer_factorization#Prime_decomposition).

## Comments: